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Abstract 



The work in this thesis concerns the spUt of Einstein field equations (EFE's) with respect 
to nowhere-null hyper surf aces, the GR CaTichy and Initial Value problems (CP and IVP), 
the Canonical formulation of GR and its interpretation, and the Foundations of Relativity. 

I address Wheeler's question about the why of the form of the GR Hamiltonian constraint 

"from plausible first principles". I consider Hojman-Kuchar-Teitclboim's spacctimc-based 
first principles, and especially the new 3-space approach (TSA) first principles studied by 
Barbour, Foster, O Murchadha and myself. The latter are relational, and assume less struc- 
ture, but from these Dirac's procedure picks out GR as one of a few consistent possibilities. 
The alternative possibilities are Strong gravity theories and some new Conformal theories. 
The latter have privileged slicings similar to the maximal and constant mean curvature 
slicings of the Conformal IVP method. 

The plausibility of the TSA first principles are tested by coupling to fundamental matter. 
Yang-Mills theory works. I criticize the original form of the TSA since I find that tacit 
assumptions remain and Dirac fields are not permitted. However, comparison with Kuchaf 's 
hypersurface formalism allows me to argue that all the known fundamental matter fields can 
be incorporated into the TSA. The spacetime picture appears to possess more kinematics 
than strictly necessary for building Lagrangians for physically-realized fundamental matter 
fields. I debate whether space may be regarded as primary rather than spacetime. The 
emergence (or not) of the Special Relativity Principles and 4-d General Covariance in the 
various TSA alternatives is investigated, as is the Equivalence Principle, and the Problem 
of Time in Quantum Gravity. 

Further results concern Elimination versus Conformal IVP methods, the badness of the 
timelike split of the EFE's, and reinterpreting Embeddings and Braneworlds guided by CP 
and IVP knowledge. 



Overview 



Here I explain what is in this thesis and on which of my articles each topic is based. 

My Introduction serves the following purposes. In 1. 0-1 I summarize the standard founda- 
tions of classical physics for later comparison (mostly in Part A); these sections also serve 
to establish much notation for the thesis. In 1.2 I provide the split formulation of GR, to be 
used both in Part A and Part B. Much of this material was included in early sections of my 
publications. This section also includes some original material filling up some gaps in the 
older literature. 1.3 briefly provides the quantum physics and quantum gravity needed to 
understand what is being attempted in Part A, and how Part A and Part B fit into current 
physical thought. 

Part A: II considers work by Barbour, Bertotti, 6 Murchadha, Foster and myself {The 
3-space Approach), on the foundations of particle dynamics, gauge theory and especially 
Relativity, starting from relational first principles. I have carefully reworked this material 
and have added to it both from my papers [SI IHl and as an original contribution to this 
Thesis. Ill considers strong and conformal alternative theories of gravity that arise alongside 
GR in this 3-space approach, and is based on the pure gravity halves of my paper |Sj and 
of my collaboration with Barbour, Foster and O Murchadha IV furthermore includes 
bosonic matter and is based on the matter halves of the papers [31 and on my paper 
with Barbour |l(Jj . VI is based on my critical article on the 3-space approach [Hj. This 
comes to grips with what the 3-space approach's spatial principles really are, and how they 
may be related to Kuchaf's principles that presuppose spacetime. My modified 3-space 
principles are capable of accommodating spin-^ fermions and more. V.l and VII are original 
contributions of mine to this Thesis which update the 3-space approach to General Relativity 
and attempt to relate it to the standard approach's Relativity Principles and Principle of 
Equivalence. I hope to publish the new material in II, V.I, and VII as a new article f^- V.2 
provides more material on the conformal alternatives; this or related material in preparation 
may appear in a solo article as well as in a collaboration with Barbour, Foster, Kelleher 
and O Murchadha |12j . VIII is an incipient account of quantum gravitational issues which 
underly and motivate motivate much of Part A. I briefiy mentioned this in the conference 
proceedings [7j, but it remains work in progress. 

Part B: This is a separate, largely critical application of the split formulation of GR. I 
explain in B.l how the sideways analogue of the usual split, cases of which have recently 
been suggested in various guises, is causally and mathematically undesirable. This work 
began when Lidsey and I came across and used a sideways method 113) , followed by my 
criticism of it in my contribution to the article with Dahia, Lidsey and Romero '14, , further 
criticism in my preprint with Tavakol ^] , and yet further related work filling in some gaps 
in the early Initial Value Problem literature, which were most conveniently presented in 1.2. 
[1^1 also considers a multitude of other treatments of proper and sideways splits for GR-like 
braneworlds. Some of this material (in B2) was published in my letter with Tavakol jl5j . 
some of it will appear in my conference proceedings [Hi (a summary of B.3), and some of it 
will form the basis for further articles that Tavakol and I have in mind. 

Appendix C on elliptic equations contains further technical material useful in both Parts. 
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I Introduction 



Geometry 

"He who attempts natural philosophy without geometry is lost" Galileo Galilei |146j 
0.1 Geometry and physics 

During the development of classical Newtonian physics, it was assumed that there was an 
absolute time. This defined a sequence of simultaneities representing nature at each of its 
instants. Each simultaneity was a copy of the apparent Euclidean 3-geometry of space, con- 
taining a collection of particles (possibly constituting extended objects). This setting was 
also assumed for the study of further emergent branches of physics, both those with firm 
foundations in terms of particles (such as fluid mechanics and sound) and those in which the 
role of particles became increasingly less clear (such as gravitation, optics, electricity and 
magnetism) . Scientists began to favour rather the description of the latter in terms of myste- 
rious fields pervading space. Field theory led to a significant advance: Maxwell successfully 
showed electricity and magnetism to be two aspects of a unified physical phenomenon, elec- 
tromagnetism, and as a consequence that light is none other than electromagnetic waves. 
Contemporary experience strongly suggested that waves arise as excitations of some ma- 
terial medium, so space was presumably pervaded with some electromagnetic medium for 
which there was as yet no direct observational evidence. However, explanations in terms of 
this medium eventually turned out to be inconsistent with experimental optics, motivating 
Einstein to begin his quest to rid physics of all additional unverifiable structures and make 
it exclusively the province of the physical laws themselves. 

This first led him to special relativity (SR) jl2Uj in which the simultaneity concept was 
destroyed. Minkowski j2H9j argued that this made it appropriate to not treat space and 
time separately as 3-d and 1-d entities respectively, but rather as a single 4-d entity: space- 
time. Now already the geometry of Minkowski spacetime differs from the usual Euclidean 
geometry (see below). Furthermore, Einstein was not satisfied with SR (see 1.1.5), partly 
because, although his characteristic wish for it to incorporate not just electromagnetism 
but all the known laws of physics was almost completely successful, the gravitational field 
remained elusive. Einstein eventually realized that its incorporation required the geometry 
of spacetime to be not that of Minkowski, but rather a geometry curved by the presence of 
gravitating matter |121j . This led him jl22j toward general relativity (GR). 

That curved spacetime is usually perceived as the arena for physics including gravitation 
is the first reason why the development of modern physics relies heavily on the 19th century 
developments in geometry. Originally Euclidean geometry was thought to be the only 
geometry. However, Gauss discovered a means of having others. Simple examples of these 
have decidedly non-Euclidean properties which are all interpretable as manifestations of a 
property called curvature . Absence of curvature is flatness: Euclidean geometry is flat. 

I consider flrst a modern approach to (without loss of generality 3-d) Euclidean geometry 
which is well-adapted to the passage to curved geometry. Lengths and angles may be 

characterized in terms of the inner products (i.p's) between vectors x with components Xa, 

relative to some Cartesian basis: 



The matrix associated with this i.p is the Euclidean metric which can always be put into 

^In this thesis, I adopt the convention of barred indices for flat space(time) and unbarred indices for 
general space(time)s. In simultaneously treating higher- and lower-d space(time)s I use capital and lower- 
case indices respectively. Repeated indices are to be summed over, unless this is explicitly overruled. 



( length of x) = i/x • x = |x| , ( angle between x and y) = arccos 





the form 



with respect to some basis. The rigid motions in (rotations, translations and reflections) 
are those which preserve lengths and angles, and thus the i.p, and thus the Euclidean 
metric, and are thus termed isometries. The rotations form the special orthogonal group 
S0(3). Together with the translations, they form the (special) Euclidean group Eucl.2 The 
action of the generators of these on a 3- vector are 

Tk : X ^ X + k , (1) 



Rn '■ X — > X — X X . (2) 

One can similarly characterize lengths and angles in 4-d Minkowski spacetime in terms 
of another i.p that is associated with the Minkowski metric tj^q which can always be put 
into the form 
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with respect to some basis. Note crucially that whereas 6ac is positive-definite, r]^(j is 
indefinite. Thus whereas nonzero vectors in space always have positive norms \x\ , there 
are three types of nonzero vector in Minkowski spacetime: those with negative norm termed 
timelike, those with zero norm termed null and those with positive norm termed spacelike. 
The existence of these three types is central to the physical interpretation of SR (see 1.1.4). 
Furthermore these notions and their interpretation locally extend to GR (see 1.1.5). 

The rigid motions of 4-d Minkowski spacetime are those motions which leave invari- 
ant the Minkowski metric. They are standard rotations, boosts (rotations involving the 
time direction), space- and time-translations and space- and time-rotations. The rotations 
and boosts form the special Lorentz group S0(3, 1). Together with the space- and time- 
translations, these form the Poincare group.^ The action of the generators on Minkowskian 
4- vectors x = [t, x] is analogous to (^Hj). 

These notions of metric and isometry readily generalize to the study of simple curved 
geometries, such as those that Einstein very successfully chose to model the gravitational 
field. The (spacetime) metric itself replaces the single Newtonian scalar as the gravitational 
field in GR. 



Before discussing curved geometry, I give further reasons for its importance in modern 
physics. The dynamical study of the laws of physics themselves, regardless of whichever 
arena they are set in, leads to associated abstract curved spaces, which often help concep- 
tualize physical issues and solve specific physical problems. This thesis uses such abstract 
spaces (configuration spaces, phase spaces and relative configuration spaces) for a wide 
range of accepted physical theories. Both the arena aspect and the dynamical aspect play 
an important role in attempting to provide competing physical theories, often by extending 
or generalizing the geometrical structure of accepted theories. III. 2 and Part B provide 
two very different examples of this. Geometry may both be used to provide single all- 
encompassing formulations of the accepted laws of nature, and to attempt to supplant 
current formulations by more unified schemes. 

^These are all continuous transformations. Reflections are discrete. Whereas continuous transformations 
can be built up from infinitesimal transformations, discrete transformations are large (they have no such 
constituent pieces); the word 'special' here means that such are excluded. 

'^Time-reflections are usually undesirable. Space reflections are usually included to form the orthochronous 
Lorentz and Poincare groups. 



0.2 Intrinsic geometry of the 2-sphere 




a) b) c) d) 

Figure 1: 

Consider the surface of a 2-sphere. Some of its properties happen to be independent of the 
conventional visuahzation's surrounding . These are the intrinsic properties of the sphere 
itself, which would be noted by beings whose motions and perceptions are restricted to the 
surface of the sphere only. Geometers among such beings could in principle notice that 
although their world is locally E^, it has curiously non-Euclidean properties on sufficiently 
large scales. For example (Fig 1) a) if they fixed one end of a taut thread of sufficient length 
/ and traced out the locus of the other end, they would find that its circumference c < 2ttI. 
b) Angles made between three of the straightest possible lines might not add up to vr e.g 
octants contain 3 right angles, c) Gyroscopes moved between two fixed points may have 
a path-dependent final orientation of the precession axis, d) Initially-parallel straightest 
possible lines may converge. It turns out that all these symptoms can be characterized in 
terms of a single quantity: the intrinsic curvature. I next investigate this more formally, as 
required for general higher-d surfaces. 

0.3 Differential geometry 

Begin with as little structure as possible. Consider a real topological manifold M i.e a 
particular kind of topological space that is everywhere locally K"". Since one wishes to 
study tensorial differential equations that represent physical law, one would normally require 
the manifold to be sufficiently smooth. Tricks used are j3461 13631 1344j : 1) to work with 
manifolds that are locally and consider a collection of charts (mappings from an open 
set in M — > K") that cover the whole manifold, moreover for which the notion of nearby 
points in all the overlaps between charts is consistent. Then one can establish a differential 
structure on the manifold. 2) To introduce vectors on the manifold as the tangents to curves 
(mappings I — > M for I an interval of 3) To then compose these maps to make use of 
the ordinary calculus of W to 5?*^, and moreover show that all the notions involved are 
chart-indep endent . 

4) To use the machinery of linear algebra to form the dual vector and all the higher- 
rank tensors on the manifold. From the perspective here, a dual vector at a point p on 
the manifold is a linear map Tp(M) — > where Tp(M) is the tangent space at p, a vector 
at p is a linear map T*(M) — > where T*{M) is the cotangent space at p (the dual of 
the tangent space), and a rank (k, I) tensor at p is a multilinear map from the product 
of k copies of T*{M) and / copies of Tp(M) to SR. A collection of vectors, one at each p 
G M, constitutes a vector field over M, and tensor fields are similarly defined. The more 
old-fashioned equivalent definition of a (A;, /) tensor is in terms of components: that these 

transform according to T'^'^-^'k ., = V''^i^...V'ki^U^ ...V^ jiT^^---'''' 

(3) 

in passing between unprimed and primed coordinate systems, where Uji = is the 
general curvilinear coordinate transformation. 



5) It is not straightforward to introduce derivatives acting on vector fields fields because 
derivatives involve taking the limit of the difference between vectors at different points. So 
then these vectors belong to different tangent spaces. Whereas in K one can just move the 
vectors to the same point, this is not a straightforward procedure on a curved surface [c.f 
fig Ic)]. The usual partial derivation is undesirable since it does not map tensors to tensors. 
However, it suffices to construct such a derivation acting on vectors and acting trivially on 
scalars, since then one can easily obtain its action on all the other tensors by the Leibniz 
rule. 

Lie derivation does map tensors to tensors, but is directional because it is with respect to 
some additional vector field along which the tensors are dragged. I denote this derivative 
by £^ . It is established from dragging first principles j346j to act on scalars and vectors as 

£^S = eOiS , (4) 



£^V'' = ediV" - diCV' . (5) 

To have a non-directional derivative that maps tensors to tensors, one corrects the partial 
derivative by introducing an extra structure: the affine connection, which is a rank (1, 2) 
non-tensorial object with components denoted by V^bc, transforming not like Q but rather 

according to j'k' = Lf V jiV' ^iT^ j^+V' i-^V jidjV k' ■ 

(6) 

The non-tensoriality of the connection compensates for that of the partial derivative so that 
tensors are mapped to tensors. The derivative obtained thus is the covariant derivative. I 
denote it by D or V. It is just the partial derivative when acting on scalars, but is 

Dav' = dav'' + T\,V'' (7) 

when acting on vectors. For later use, note that Lie derivatives can be re-expressed in terms 
of covariant derivatives £^V'' = CDiV-Di^^V' 

(8) 

when the manifold happens to have affine structure. 




Figure 2: Two different ways of transporting a vector io°. u, v parameterize 2 arbitrary curves with 
tangents a;" and j/" respectively. 

A consequence of the non-tensorial transformation law of the afhne connection is that 
coordinates may be found in which the affine connection is zero at any particular point, 
which is crucial in the development of GR. The affine connection may be interpreted as 
giving a notion of straightest possible transport of vectors along curves. Now, one finds 
that such parallel transport along two paths generally depends on the order in which the 
two paths are traversed (Fig 2). One finds that a combination of derivatives and squares of 
the affine connection, the Riemann curvature tensor. 



jya — o pa o -pa , pe pa pe pa 

-K bed = Cci bd — Crfi bc + >- bd>- ec — J- bc>- ed , 



(9) 



can be associated with this property of the transport of a vector w"". 

R\c.v.'-V = . J^) ■ (10) 



Instead one can define the Riemann curvature tensor from the 'Ricci lemma' 

2V[aVb]W' = Wdabw'' . (11) 

In afline geometry the 'locally straightest paths' or affine geodesies may be parameterized 



in terms of some v so as to have the form = x'^DaX^ = x'^+T^ jkX^ x'' = , 

Du 

(12) 

where a dot denotes If one considers two neighbouring geodesies with initially-parallel 
tangent vectors x"", then one can arrive at the Riemann curvature tensor by considering the 
relative acceleration of two neighbouring geodesies with connecting vector 2" (fig 3) 



i^^edx'x^z'^ . (13) 



Note that this is a nonlocal effect. 




Figure 3: For 2 nearby geodesies 71, 72 in a congruence, each parameterized by u, the connecting vector 
z is the tangent to the curve connecting equal-;/ points. 



It is necessary to exclude torsion T = ^ [be] = 

(14) 

for formulae ()1U|) . H11() and (|13() to hold and for it to be true that it is necessarily curvature 
that underlies these symptoms (since the torsion corrections to the formulae by themselves 
produce similar symptoms). 

The Riemann tensor obeys R°'bcd = —R"'bdc 

(15) 



and the first Bianchi identity R°'[bcd] = j 

(16) 



whereas its derivatives are related by the (second) Bianchi identity: [e\R°' b\cd] — • 

(17) 



The Ricci tensor is Rtd = R"'bad ■, 

(18) 



which is symmetric: 



Rab — Rba 



(19) 



Another structure is required in order to have a notion of distance (in the spacetime 
or space apphcation this is desirable to model how observers perform measurements of 
length-duration). A metric tensor gab is brought in for this purpose. It is usually taken to 
be symmetric, nondegenerate and a function of the coordinates alone. Then {M,gab) is a 
(semi-)Riemannian manifold.^ I denote the determinant of the metric by g and its inverse 

ri^2 

by g"'''. The metric defines length along a path with tangent i;"^ by s = / ^/ga},i°^(lv 

and (hyper)volume by ^ ~ J ^^^"^"^ ' 

(Semi-)Riemannian geometry has a metric connection ( Christoffel symbol) 

I l^lg'^HdcgM + dbged-ddgbc) (20) 

6 c J 2 

with corresponding metric geodesies (paths locally of extremal length) parameterizable as 

r + l |xV = 0. (21) 
In (semi-)Riemannian geometry all intrinsic properties follow from the metric: one assumes 



that the affine connection is the metric connection: < > = r"i,r , 

(22) 

so then geodesies and the Riemann tensor are expressible in terms of the metric. Einstein 
assumed this simple geometry for GR, which has so far been consistent with observation. If 
however one does not assume the above equality, then the difference of the two connections 
would constitute an additional tensor generally composed of torsion and nonmetricity parts. 
Apart from allowing these, other ways of having more complicated geometry are for the 
metric to be non-symmetric, or degenerate, or to depend on the velocities as well as the 
coordinates (as in Finslerian geometry). These will not be considered as options for the 
geometrization of space(time), but the last two occur in my dynamical study. 

The metric and its inverse may be used to lower and raise indices on other tensors. Thus 
one can define the Riemann tensor with all its indices downstairs by 

Rebcd = 9eaR"'bcd (23) 



which has the additional symmetry property Rabcd = R-cdab 



One can also now obtain the Ricci scalar R = g Rbd 



(24) 



(25) 



^Genuine Riemannian metrics arc those which are positive-definite; those which are indefinite are termed 
semi-Riernannian. I also refer to tfiese as Euclidean- and Lorentzian-signature metrics. I use qab and 
hab for metrics when simultaneously treating higher- and lower-d manifolds, with corresponding covariant 
derivatives Va and Da- Higher-d objects are generally distinguished with checks. All the formulae of this 
section should bo transcribed accordingly prior to their application. Round brackets surrounding more than 
one index of any type denote symmetrization and square brackets denote antisymmetrization; indices which 
are not part of this (anti)symmetrization are set between vertical lines. 



The curvature tensors contain the following numbers of independent pieces of information: 



#Rabcd — ^ 1 #Rab — ^ ^ ^ > #R — 1 • (26) 

This establishes which tensor suffices to describe intrinsic curvature in each dimension: R 
suffices in 2-d, Rab in 3-d and all of R°'bcd is required in all higher-d. It also establishes 
whether the numbers of equations and unknowns make sense in different possible physical 
theories based on such geometrical objects. The (irreducible) part of R°'bcd not in Rab is 

2 2 
the Weyl tensor Wbcd = R^bcd - [cRd\b - 9b[cRd]l - _ -^^^^ _ '^"[dgc]fe-R , 

(27) 

which inherits all the symmetry properties of the Riemann tensor. 

The Einstein tensor (used in the construction of the Einstein field equations), 

R 

Gab = Rab — -^9ab (28) 

has the important properties of being divergenceless by the contracted Bianchi identity 

y'Gab = , (29) 
and a symmetric (0, 2) tensor like the metric. 



App O.A Densities and Integration 



A (n, m) tensor density of weight w is an object which transforms like 

T'^-'-l^,,:,^ = (^/ditL)-L'^\,...L'^",„L^^^^...L''^g^r'^--",,..,„ (30) 

I denote w = 1 and w = —1 objects by overlines and underlines respectively. In particular, 
the square root of the metric determinant is a scalar density of weight 1. I need the following 



ingredients to build up derivatives of densities: £^Vh = VhD^'^a 

(31) 

Da^/h = . (32) 

For integration to make sense on a manifold, integrands must be weight-1 scalars (since 
it does not make sense to add higher-rank tensors at distinct points). Furthermore, (|32|) 

allow, one to have a divergence theorem / TdE. = / B Jd« 

JdV Jv 

for a"' a (1, 0) vector density of weight 1 and where dS^ and d^l are the obvious (hy- 
per)surface and (hyper)volume elements. 



1 Classical physics 
1.1 Newtonian physics 

Newton j279j gave us three fundamental laws of motion. 

Nl Every body continues in its state of rest, or of uniform motion in a right line unless it 
is compelled to change that state by forces impressed upon it. 

N2 The change of motion is proportional to the motive force impressed; and is made in the 
direction of the right line in which that force is impressed. 



N3 To every action there is always opposed an equal reaction. 

In the modern notation adopted, the position of a Newtonian particle is x, and its 
velocity is x = ^ where t is Newton's notion of time (see below). Then N2 reads 

dp ^ ^ 

Impressed force F = — , for p the momentum . (33) 

Newton's momentum is p = mx; in most applications the mass m of the particle is taken 
to be constant. From H33|) it follows that in the absence of the action of external forces the 
momentum p is conserved. 

Newton explained j28Uj that his laws are to be interpreted as occurring in his all- 
pervading, similar and immovable absolute space while his external absolute time flows. 

There are a number of frames of reference in which Nl holds, called inertial frames, 
which are at rest in absolute space or moving uniformly through it along a right line. These 
inertial frames are inter-related by the transformations of Galilean relativity : 

X — >x'=x + vi + k, (34) 



t — >t + l , (35) 

for constant v, k and /. Non-inertial observers perceive additional fictitious forces. Note 
that (|35|) is a statement of existence of absolute time up to choice of time origin. Newton's 
laws are not necessarily Galileo- invariant. This would further require jSH] 
'N4' The masses, and the strengths of the forces are independent of the motion of the centre 
of mass of the system of particles relative to absolute space. 

The immovable external character of absolute space and time is abhorred by relational- 
ists, who maintain that the laws of physics should rather be based on relative notions alone 
(see 1.1.5 and II- VIII). 

Newton also provided an explicit form for a particular universal force: 
Newton's law of gravitation: The gravitational force on a particle of mass W'(i) at 
position X(]^-) due to a particle of mass m(2) at position X(2) is given by^ 



g _ "■■Hi)'"(2) . s 

|E(1)(2)I 



^fi){2) written as F^i)(2) = ~^{i)(^<P(i)(2) ^or 0(i)(2) = | the gravitational 

potential at x^;^) due to the particle of mass m(^2) ??(2)- Newtonian gravity is linear, so 
one can build up by superposition the total gravitational force at x due to all the particles 
in the universe, F^(x) = —[md(f)]{x). Combining this with (|33j) . one obtains the equation of 
motion for a (constant-mass) particle (in the absence of any other type of forces): 

x=-[5(/>](x) . (37) 

Consider this for two neighbouring particles at positions x and x + Ax. Then subtracting 
and by the definition of derivative one obtains the tidal equation 

AX = -d{d ■ ax) (38) 

for the relative acceleration of the two particles. 

Finally, one requires a field equation describing how sources produce gravitation. In 

differential form, for 

the mass density, it is Poisson's law JD^cf) — AttG . 

(39) 

Newtonian gravity is very successful at describing the solar system: Kepler's laws and 
perturbations due to planet-planet interactions follow from it. However, a small anomalous 
residual perihelion precession of Mercury was eventually observed. 

^Units: in 1.1.1-5 I keep G (Newton's constant), c (the speed of light) and h (Planck's constant) explicit. 
I then set c — h — 1, SirG = 1. 

1 use (i)-indices to run over particle labels, and t(i)(j) = X^jj — X(j). 



1.2 Principles of dynamics 



1.2.1 Configuration space and the Euler— Lagrange equations 

Theorists can choose to describe the position of a particle in (absolute) Euclidean 3-space 
by 3 curvilinear coordinates. Convenience and not any physical reality underlies which 
particular choice. Mechanical systems are usually taken to be second order, so that the 
initial position of the particle does not suffice to determine the motion. One requires also 
e.g the initial velocity or the initial momentum. For n particles in E^, naively one requires 
the prescription of 3n coordinates to describe their positions. But, the particles may not 
be free to move in all possible ways, e.g some of them could be attached via strings, springs 
or rods. Such constitute constrained mechanical systems, describable in terms of less than 
3n independent coordinates, ^a-^ 

The space of coordinates that describe a system is its configuration space Q. The motion 
of the whole system is represented by a curve in this configuration space. Sometimes one 
may attempt to work with a Q that is larger than necessary e.g a 3n-d Q for a n-particle 
system with p constraints. However, some aspects of the system would be better understood 
if one were able to take into account the constraints and pass to a (3n - p)-d Qreduced- 

Whereas one may attempt to study such particle systems directly using Newton's laws, 
this may be cumbersome and requires knowledge of all the forces acting at each point in 
the system. A method which is often more practical, and which extends to field theory, 
is that based on energy considerations, as formalized by Lagrange, Euler and others j246j . 
One computes the kinetic energy T of the system and the potential energy V. Then one 
forms the Lagrangian L = T - V, knowledge of which permits one to write down a set of 
equations of motion equivalent to Newton's. These equations arise by declaring the standard 
prescription of the calculus of variations: the equations of motion are such that the action 
I = / dtL is stationary with respect to q/\ when one considers the true motion between two 
particular fixed endpoints ei and 62 together with the set of varied paths (subject to the 
same fixed endpoints) about this motion. Then for a second-order mechanical system. 



= 6\= dt t^Sqa + -tt-Sqa = / dt 

Jei \9qA OQA ) Jei 



dl d / dl 



dqA dt \dqA 



2A + 



w^oqA 



62 



ei 

(40) 

by parts. Now note that the last term vanishes because the endpoints have been taken to 
be fixed. As 5qA is an arbitrary variation, the laws of motion in their differential form 

d / dl\ dl 

follow: the Euler-Lagrange equations (ELE's) — — — — — — = . 

dt ydqAj dqA 

(41) 

These equations are easier to integrate for two special cases of coordinate. Thus one 
main theme in the principles of dynamics is to judiciously choose a coordinate system with 
as many special case coordinates as possible. This extends the well-known exploitation of 
conservation of momentum and angular momentum. The special cases are 

1) Lagrange multipliers: if L is independent of (jn, the corresponding ELE is 

2) Cyclic coordinates: if L is independent of qn, the corresponding ELE is 

— — = constant . (43) 
dqn 

1.2.2 Legendre transformations, Jacobi's principle and Hamiltonian dynamics 

®My capital sans-serif indices are general indices, and my < i >-indices run over indexing sets. I use 
capital sans-serif letters for objects used in the principles of dynamics. 



Suppose one has a function F(a;A : A = 1 to m]y\isi) and one wishes to use za = as 
variables in place of the xa- To obtain equivalent equations of motion, one has to pass 
to a function G = xa-^a ~ F(xA,yw) which may always be written as G{z/\,yY^/). This is 
the Legendre transformation. It is symmetric between xa and zj\: if one defines xa = ■§^, 
the passage is to F(xA,yw) = xaza — G{zA,y\N)- In particular, if the function one has is a 
Lagrangian L(gA,9A)) one may wish to use some of the (generalized) conjugate momenta 

as variables in place of the corresponding velocities. 

A first example of Legendre transformation occurs in Routhian reduction: given a La- 
grangian with cyclic coordinate Q, \-{qB',QB,Q)^ then P = — constant , so that one 

may pass from L to the Routhian R{qB', Qb) = ^{qb', QBi Q) — PQ- 

A second example, is the formulation of Jacobi's principle j246j . If one has a L(gA;4A) 
which does not depend explicitly on time, t. Then one can express I = J dtL as 

1= / dT[t'l{qA;q'^,t')] (45) 



i.e r-parameterize the action by adjoining time to the positions, where ' = Now as a 
consequence of the explicit t-independence of the original Lagrangian, t is a cyclic coordinate 
in (|45j) so (as a special case of Routhian reduction) one may pass to 

dV. 

Lj(9a; ^a) = "-(^a; ^a)*' - P*^' ^r _ = = -E , constant . 

(46) 

Minimization of Ij = / drLj is Jacobi's principle. It is a geodesic principle since its use 
reduces the problem of motion to the problem of finding the geodesies associated with some 
geometry. I particularly use the special case of a mechanical system which is homogeneous 

quadratic in its velocities: L = T— V = -M^^q/\qB—\/{qA) , 

(47) 

where M^^ is the 'kinetic matrix'. Then, I = j drt' {^^M"^^ {qf^jq^qB - ^{iS^ ■ 



Using (llHll as an equation to eliminate t', E = — ■ps^^^q'hqB'^'^i^lh) =^ t' — 

zt'^ V E — V 



Thus 



(48) 

for dcj^ = MAedg^dg^ the line element of the configuration space geometry. Thus in this 
case use of Jacobi's principle translates the problem of finding the motion of the mechanical 
system into the well-defined, well-studied problem of finding the geodesies of a Riemannian 
geometry with line element da'^ related to dcx^ by the conformal transformation 
da^ = 2(E-V)dcj2. 

Further examples of Legendre transformation include the passage to first-order form in 
VI, in addition to well-known moves in thermodynamics and QM. There is also the passage 

from the Lagrangian to the Hamiltonian ^{q/KiP^] = P^q/\~^iqAj 'ZAi t) 

which makes use of all the conjugate momenta. The equations of motion are now 

. 5H .A 9H 

Hamilton s equations q^ = — — r- , p = — 7^ — . 

op'^ oq/K 

(49) 



Instead of working in configuration space Q, one now works in the (naively 6n-d) phase 
space (Q, P) where P is the set of the momenta p*^*^. There are canonical transformations 
on phase space which generally mix up what are the positions and what are the momenta: 

qA,p'' ^Qa,P^ . (50) 

Whereas configuration space has a (commonly Riemannian) geometry on it with a symmet- 
ric metric Mab as its preserved object, phase space has a preserved object p^dq/i, up to a 
complete differential. This gives rise to an antisymmetric symplectic structure. This is most 
usefully phrased via the introduction of Poisson brackets 

fff ^\ ( Am _ df dg dg df , 
{f{,A,P ),9{,^,P )} = 5^5^-a^5^' (51) 

whereupon the equations of motion take the form {qa, H} = qa , {p^, = p^ . 

1.2.3 Dirac's generalized Hamiltonian dynamics 



This subsection provides a systematic means of studying constrained systems. The field- 
theoretical version of this is of particular importance in this thesis. I use #A to denote 
the number of degrees of freedom (d.o.f's) in the set indexed by A. If not all the conjugate 
momenta p^ = can be inverted to give the qa in terms of the p^, then the theory has 

primary constraints Cp{qfi,,p^) = solely by virtue of the form of L. As Dirac noted jll2j . 
in such theory described by a Hamiltonian [-\{qfi„p^) could just as well be 

described by a Hamiltonian H'^°*''^ = H+NpC^ , 

(52) 

for arbitrary functions A'^p. Moreover, one needs to check that the primary constraints and 
any further secondary constraints CQ{qi\,p^) (obtained as true variational equations Cq = 0) 
are propagated by the evolution equations. If they are, then the constraint algebra indexed 
by l<i> = PU G closes, and a classically-consistent theory is obtained. This happens when 
vanishes either by virtue of the ELE's alone or additionally by virtue of the vanishing 
of C|^-^^ . Following Dirac, I denote this weak vanishing by C\^^^ ~ 0. 

If C\^^^ does not vanish weakly, then it must contain further independently- vanishing 
expressions Cs^^y{qh^P^) in order for the theory to be consistent. One must then enlarge 
the indexing set to l<2> = l<i> US<i> and see if C|^2> ^ 0. In principle, this becomes an 
iterative process by which one may build toward a full constraint algebra indexed by l<final> 
by successive enlargements l<i+i> = l<i>US<i>- 

In practice, however, the process cannot continue for many steps since #l<i+i> > #l<i>, 
#A is a small number (for field theories, a small number per space point), and it is important 
for true number of d.o.f's to satisfy #A — #l<finai> > if one is seeking nontrivial theories. 
It should be emphasized that there is no guarantee that a given Lagrangian corresponds to 
any consistent system. The perception of the first point post-dates Dirac (see e.g |354[ 11(1] . 
and also my comment in V), whereas Dirac's book contains a famously simple example of 
the second point: L = g gives the ELE 1 = 0. 

Finally, constraints are first-class if they have vanishing Poisson brackets with all the 
other constraints. Otherwise, they are of second-class. If there are second-class constraints, 



one may define the Dirac bracket {fiOY = if id} ~ {/) Cd}({Cd; Cq/}) {CQi,g} 

(53) 

where D and D' index the smallest possible set of second-class constraints with respect to 
the original Poisson bracket. The Dirac bracket is built so that if one uses it in place of 
the Poisson bracket, there are no second-class constraints. Thus one may always choose to 
work with first-class constraints alone. 



1.2.4 Motivation for use of principles of dynamics 



The principles of dynamics developed for classical mechanics have turned out to be applica- 
ble and very useful in the study of relativistic field theory to which the rest of this chapter 
is dedicated. In particular, the ease of derivation of Einstein's field equations of GR using 
the principles of dynamics (See 1.1.6 and 1.2.4-5) made physicists appreciate the value of 
these methods. II- VII contain many further uses of the principles of dynamics in the study 
of GR. 

In the development of QM (see 1.3) the Hamiltonian plays a central role. Quantization 
itself may be naively perceived as the passage from classical Poisson brackets to commutators 
of the corresponding quantum-mechanical operators: 

{f,9}-^^\[f,9]\^^{f9-9f) . (54) 
Finally, if one replaces by ^ in H , one obtains the Hamilton-Jacobi equation 

to be solved as a p.d.e for the as-yet undetermined Jacobi's principal function S. If the 
system has constraints (see e.g |182) ) Cx{qA',P^), (|55|) would be supplemented by 

Cx (qa; 1^1 = (56) 



dQA 

The Hamilton-Jacobi formulation is close to the semiclassical approximation to QM. 
1.3 Electromagnetism 

Many sorts of forces appear in Newtonian physics. Physicists have tried to subdivide these 
into a few fundamental forces and consequent forces. The gravitational force ()36() is regarded 
as a fundamental force; the theories of the electric and magnetic forces discussed below are 
also regarded as fundamental forces, in terms of which many other forces such as friction 
and chemical bonding are to be understood. 
The electric and magnetic force laws are 

e _ 1 g(l)g(2) ^ .K7^ 

F(i){2) = 13 ^^(1) dX(2) X r(i)(2) . (58) 

l-(l)(2) I 

(|57() is for the force on a charge at position x^;^) due to a charge g(2) position X(2). (jSS)) 
is for the force on a wire element at position x^j^^ with current /(i) due to a wire element at 
position X(2) with current 1(2)- eo ^-nd /xq are respectively the permettivity and permeability 
in vacuo. 

Just as in Newtonian gravity, it is useful to think in terms of fields. One often uses 
f = gE for E the electric field, and F™ = Jdx x B for B the magnetic field. Some of the 
field equations are then, in traditional integral form: 



eo 



do. 



E • dS = / Pcd^x Gauss's Law , (59) 



* B • dx = /Uolenciosed by wire loop Ampere's Law , (60) 

J wire loop 



where p^. is electric charge density. In the presence of changing magnetic fields, (|57jl was 
found no longer to hold; rather one has 

i E-dx = -^/" B-dS Faraday's Law , (61) 

./wire loop J do, 

which suggests that electricity and magnetism are not independent concepts. 

Provided that Ampere's law in fact also includes a displacement current correction, 
Maxwell found that he could consistently explain electricity and magnetism as aspects of a 
single phenomenon: electromagnetism . The differential form of the equations now reads 

9 • E = ^ Gauss' Law , (62) 

9 X E = — B Faraday's Law , (63) 

9 • B = non-existence of magnetic monopoles , (64) 

d X B = //o(j^ + coE) Ampere's law with displacement current , (65) 

where is the electric current. The displacement current was then experimentally verified. 
N.B it is often convenient to rewrite Maxwell's equations in terms of a magnetic potential 
A and an electric potential s.t. B = 5 x A and E = —d^ — A. 

Here are some useful comments for later. First, from this unification of electricity and 
magnetism into electromagnetism, Maxwell unexpectedly deduced that light is nothing but 
electromagnetic radiation [by combining the above equations to form two wave equations 
whose propagation speed is the speed of light c = (eo//o)~^]- 

Second, whereas (|63() [and are the usual sort of evolutionary laws, H62|) [and (jHH)] 
are constraints i.e instantaneous laws about the permissible configurations. 

Third, Maxwell's equations are field equations, determining how sources produce elec- 
tromagnetic fields. To have a full grasp of electromagnetism, one also requires a law to 
compare the motion of (constant mass) charged and uncharged particles in the presence of 

an electromagnetic field: the Lorentz force law x = — (E +Y x B) • 

m 

(66) 

Fourth, if Maxwell's equations provide an account for light waves, then the contemporary 
experience with other types of waves suggested that there should be an associated medium, 
the luminiferous Aether, of which light is an excitation. Fifth, Maxwell's equations are 
not invariant under the Galilean transformations l|34( I35j ) . Instead, one may define a new 
group of transformations which leave Maxwell's equations invariant: the Lorentz group, 
most conveniently described as follows. It consists of rotations and boosts. 



t — > t' = 7[t - vx/c^] , 7 = - {v/cf (67) 

X — > x' = 7(x — vt) (68) 

y — ^y' = y (69) 

z — > z' = z (70) 

is the boost for passing from a frame at rest to a frame moving with constant velocity v in 
the X direction. For other directions of motion, rotate the axes, then apply the above and 
then rotate back. 



1.4 Special relativity 

The last two issues above have profound consequences. Given the existence of the Aether, 
its rest frame would be expected to be be privileged by Maxwell's equations, so the lack of 
Galilean invariance was not perceived as an immediate impasse. This led to the proposal 
that as electromagnetism is not Galileo- invariant, experiments involving electromagnetism 
could then be used to determine motion with respect to the Aether rest frame. There were 
furthermore speculation that this Aether rest-frame might coincide with absolute space. 

However, in the Michelson-Morley experiment a null result'' was obtained for the ve- 
locity of the Earth relative to the Aether. Furthermore, within the framework of Aether 
theory, this was in contradiction with Bradley's observation of stellar aberration. Although 
Fitzgerald and Lorentz |257j attempted to explain these observations constructively in terms 
of somehow the inter-particle distances of particles travelling parallel to the aether flow 
being contracted, Einstein had a different, axiomatic strategy akin |124| I128j to how ther- 
modynamics is based on the non-existence of perpetual motion machines. He elevated the 
outcome of the Michelson-Morley experiment from a null result about motion and elec- 
tromagnetism to a universal postulate. Rather than there being Galilean invariance for 
mechanics, Lorentz invariance for electromagnetism and goodness knows what invariance 
for other branches of physics, he postulated that 

RPl (relativity principle) all inertial frames are equivalent for the formulation of all physical 
laws. 

From this it follows that the laws of nature share a universal transformation group 
under which they are invariant. There is then the issue of which transformation group 
this should be. RPl narrows this down to two obvious physical possibilities, distinguished 
by whether the laws of nature contain a finite or infinite propagation speed Vprop-^ If one 
adopts absolute time as a second postulate (Galilean RP2), the infinite is selected, and 
one has universally Galileo-invariant physics. The finite is selected if one adopts instead a 
constant velocity postulate such as 

Lorentzian RP2: light signals in vacuo are propagated rectilinearly with the same velocity 
at all times, in all directions, in all inertial frames. 

The chosen velocity serves universally [and so is unique, so taking Uprop = (the speed of 
light c) is without loss of generality]. One has then a universally Lorentz-invariant physics. 
In the former case, which amounts to upholding 'N4', electromagnetism must be corrected, 
whereas in the latter case Newtonian mechanics must be corrected. Einstein chose the 
latter. Notice that this is the option given by a law of nature and not some postulated 
absolute structure; also whereas there was ample experimental evidence for Maxwellian 
electromagnetism, existing experimental evidence for Newtonian mechanics was confined to 
the low velocity (u<Cc) regime for which Galilean transformations are an excellent approx- 
imation to Lorentz transformations. Indeed the investigation of the high velocity regime 
promptly verified Einstein's corrections to Newtonian mechanics. This example of the great 
predictive power of special relativity is compounded by the universality: for each branch of 
physics, one obtains specific corrections by requiring the corresponding laws to be Lorentz- 
invariant. The concept of non-materially substantiated media and the above proposal were 
thus destroyed, and physics was rebuilt on the premise that there was no room in any of its 
branches for analogous concepts and proposals. 

Minkowski pointed out that whilst Newton's notions of absolute space and time are 
also destroyed because there are no longer privileged surfaces of simultaneity, one could 
geometrize space and time together as spacetime, in which the null cones are privileged. 
These correspond to the surfaces on which the free motion of light occurs (and of all other 
massless particles, by Einstein's postulates: one has a universal null cone structure in clas- 

'^Today this is known to be null to 1 part in 10^^ )fci2l . 

*If one wishes furthermore to avoid causality paradoxes one chooses this to ba a maximum speed (rather 
than a minimun one for tachyons). 



sical physics). And massive particles are permitted only to travel from an event (spacetime 
point) into the interior of the future null cone of that event. Of particular significance, in 
free 'inertial' motion all massive particles follow timelike straight lines whereas all massless 
particles follow null straight lines. Following from such a geometrization, it makes sense to 
implement the laws of physics in terms of the 4-tensors corresponding to Minkowski's 4-d 
spacetime. 

First, reconsider electromagnetism. Whereas in this special case the main laws (Maxwell's 
equations) are already Lorentz-invariant and thus require no corrections, they can be cast 
in an elegant spacetime notation, and the new conceptual framework greatly facilitates the 
study of electrodynamics jl2Uj . Introducing the electromagnetic field tensor F^p 

(s.t F^o = ^a, Fap = ^apcB'^) the Maxwell equations are djF^^ = —fJ.oie 

(71) 

dlAFBc] = (72) 



If one uses an electromagnetic ^-potential A = [—<!>, A] s.t = 29[^A^] , 

(73) 

then (f7^ holds trivially and one is left with UA^ —d^dj^K^ = — ^oJ^ 

(74) 

where = [pe^j^]- Naively A^ would have 4 d.o.f's. But there is the constraint (|62|) : also 
(not unrelatedly, it turns out) H73() is clearly invariant under the gauge transformations 

k^^k^ + d^K (75) 

for any function A. Thus electromagnetism has 2 d.o.f's per space point, corresponding to 
light having 2 polarizations. For later use, the energy-momentum tensor of electromag- 
netism is^ 

t^^,^ = j^if^^f^c - {ri^^'P ° F) (76) 

which is symmetric and is conserved. The Lorentz force law (|Hf)j) becomes 

cPx^ e dx^ 
dr^ m ^ dr 

where r is the time measured in the particle's rest-frame (the proper time). 

Next it was required to change the forms of all the laws of nature. ^'^ For Newtonian 
Mechanics, N2 and the definition of momentum are still correct, provided that proper time 
is employed. The relativistic laws of nature are a great success. Indeed in many applications 
an important step toward proposing new laws of physics is to consider only the Lorentz- 
invariant possibilities. However, Einstein found that attempting to accommodate gravity 
in this scheme presented significant difficulties. 

1.5 General relativity 

Nearby freely-falling particles in a (non-uniform) gravitational field experience a relative 
acceleration. This leads to the need to replace the inertial frames of Newtonian mechanics 
(which are supposedly of infinite extent) by local inertial frames. In order to be able to 
define these it is crucial that inertial mass be identically proportional to gravitational mass 

use ^ o _B to denote the trace of the matrix product of A and B. 
^''Since this occurred in 1905, we mean all the other classical laws of nature known at that time. 



(77) 



for all materials, for else each material would require its own definition of local inertial 
frame. This is the principle of equivalence (POE). Einstein jl21j then adopted the some- 
what stronger supposition^^ that gravitation is not locally distinguishable from acceleration 
by physical experiments anywhere in the universe, and can thus be transformed away by 
passing to the suitable local inertial frame. He then guessed^^ that the inertial frames of 
SR were to be identified with the local inertial frames of freely- falling particles. To Einstein 
the POE strongly suggested |13nj that gravitation could be included within relativity by 
boldly postulating that spacetime with gravitation would not be flat Minkowski spacetime 
but rather a spacetime curved by the sources of gravitation so that the straight timelike 
lines followed by free particles in Minkowski spacetime are bent into the curves followed by 
relatively- accelerated freely- falling particles. The straight null lines constituting the light- 
cones of Minkowski spacetime would then likewise be bent by the sources of gravitation. 

The mathematics of the connection permits the incorporation of the above features of 
the gravitational field. The coordinates in which the connection may be set to zero at each 
particular point are to correspond to the freely- falling frame at that point. The privileged 
curves followed by freely falling particles and by light rays are to be the timelike and null 
affine geodesies of the geometry; at any point in the freely-falling frame these reduce to 
the straight lines of Minkowski spacetime. ^'^ The geodesic equation (|12|) is of the form of 
the combination ()37|) of N2 and Newton's law of gravitation for a connection whose only 
nonzero components are Fqq = di(p; from this it follows that the only nonzero Riemann 
tensor components are 

R'oio = didj^ (78) 

so that one obtains agreement between the Newtonian tidal equation H38|) and the cor- 
responding case of the geodesic deviation equation (|T3|) . In GR the geodesic deviation 
equation plays an analogous role to that of the Lorentz force law H77|) in electrodynamics 

Furthermore, Einstein introduced a semi-Riemannian metric qab on spacetime, both 
to account for observers in spacetime having the ability to measure lengths and times if 
equipped with standard rods and clocks (paralleling the development of SR), and further- 
more to geometrize the gravitational field. For simplicity, he assumed a symmetric metric 
and it turned out that the aforementioned connection was the metric one jl22j . As g^B 
reduces locally to SR's rj^§ everywhere locally the laws of physics take their SR form. 

This is not yet a gravitational theory: field equations remain to be found. Einstein |12ISj 
'derived' his field equations (EFE's)^^ 

T=. 1 T=, 87tG\ -Matter 

Gab = Rab - -5abR = I I T^^ (79) 

by demanding 

GRP (the General Relativity Principle) that all frames are equivalent embodied in space- 
time general covariance (the field equations are to be a 4-tensor equation). 
GR Newtonian Limit that the correct Newtonian limit be recovered in situations with 
low velocities u <C c and weak gravitational fields (p <^ c^. Note that by (|78|) Poisson's 

equation of Newtonian gravity (|39|) may now be written as Rqo = ^T^Gp 

(80) 

which is suggestive that some curvature term should be equated to the energy-momentum 
causing the gravitation. 

Tests distinguishing between the various forms of the principle of equivalence have been devised and 
used to experimentally confirm each of these forms to high accuracy |379l . 

^■^This guess is now experimentally supported |34tj) by Pound-Rebka type experiments |379l to 2 parts in 
10-^ 

^■^Thus this implementation of the POE tacitly includes the signature assumption contained in 
Lorentzian RP2. 

^"^Tab is the curved spacetime energy-momentum tensor. 



GR divergencelessness since Tab is conserved (divergenceless: V^T 
metric, this curvature term should also have these properties. 
Thus, from the end of 0.3, Gab is a good choice of curvature term. 



-AB 



0) and sym- 



It is also helpful that both the EFEs' Einstein tensor and the metric which solves them 
are (0, 2) symmetric tensors, so that the EFE's are neither over- nor under-determined. 
In principle if one attempted other geometrizations of the gravitational field, one would 
usually face mismatches rather than the above coincidences |328l . 

I leave the disputed role of Mach's principle |265j (about how to abolish absolute space) 
both in the above conception of GR and within GR itself to 1.2.7.3 and II. 2. 

The above considerations are all physical. But in fact the following mathematical GR 
Cartan simplicities are also required to axiomatize GR: that Gab contains at most 
second-order derivatives and is linear in these. The GR Lovelock simplicities |259j 
eliminated the linearity assumption in dimension n < 4. One should note that throughout 
AgAB is an acceptable second term on the left hand side by all these considerations. Such 
a A is a cosmological constant which is thus a theoretically-optional feature, the need for 
which is rather an issue of fitting cosmological observations. 

The credibility of GR was rapidly established by its precise explanation of the perihelion 
shift of Mercury and experimental verification of the bending of lightrays by the sun. Today 
such solar system tests (also including the time-delay effect) show no significant deviation 
from GR to 1 part in 10^ j379j . Binary pulsar data are also in very good agreement with 
GR. Note however that all these tests are of the weak-field regime of GR. There may soon 
be tests of a more strong-field regime from gravity wave experiments (see 1.2.11). 

GR however may be interpreted as having theoretical impasses. It predicts its own 
inapplicability in extreme circumstances (in black holes, and during a tiny time interval 
after a cosmological Big Bang) that are moreover likely to occur in our universe (by the 
Hawking-Penrose singularity theorems jl77l I363j ) . Perhaps relatedly, one does not know 
how to combine GR and QM to form a theory of quantum gravity necessary for the study 
of these extreme regimes (see 1.3). 

1.6 Many routes to relativity 

There are a number of other routes to obtaining GR. It is important to have as many routes 
as possible to a physical theory since a number of them will allow new insights into how the 
theory works or be particularly adapted to the solution of otherwise intractable problems. 
Furthermore, different routes may suggest different alternative or more general theories for 
the current theory to be tested against. 

Wheeler listed 6 such routes to GR in 1973 |273j . The first is Einstein's above. The 
second is Hilbert's, from variation of the Einstein-Hilbert action |37Uj 



A simplicity proof equivalent to Cartan's but for actions was given by Weyl |37Uj . This 
derivation of the EFE's is very straightforward (see 1.2.4), which helped to bring variational 
principles to the attention of the physics community. 

The third and fourth routes are the two-way workings between (|81|) and the Arnowitt- 
Deser-Misner (ADM) action, which (in vacuo for simplicity) takes the form (19| 




(81) 





(82) 



Hi 



2Djp,^ = 



(84) 



(up to a divergence term), which follow from the 'ADM' split of the spacetime metric with 
respect to a sequence of spatial slices, according to 

Here, hij is the metric induced on the spatial slice (see II. 2.1 and fig 4) with determinant /i, 
the lapse a is the change in proper time as one moves off the spatial surface and the shift 
Pi is is the displacement in identification of the spatial coordinates between two adjacent 
slices. The dot is the derivative in the (time) direction perpendicular to the slice, 
Conventionally, hij, f3i and a are regarded as canonical coordinates, p^^ is the momentum 
conjugate to hij and there is no momentum associated with Pi nor a: these are Lagrange 
multipliers. Thus the true gravitational d.o.f's in GR are contained in 

Riem = {space of Riemannian 3-metrics} (86) 

on a fixed topology, which I usually take to be compact without boundary (CWB). But 
the true d.o.f's are furthermore subjected to the Hamiltonian and momentum constraints 
H and Tii respectively. 

gAB 

Figure 4: 

The 3-space approach (TSA) of this thesis is mainly concerned with the third and fourth 
routes, particularly the extension of the derivation of ADM's system of field equations 
without starting from any spacetime formulation of GR. For completeness, the fifth and 
sixth routes mentioned are the Fierz-Pauli spin-2 field in an unobservable flat background 
and Sakharov's idea that gravitation is an effective elasticity of space that arises from 
particle physics |322j . 

One could add a number of (mostly) more recent routes to Wheeler's list. Three el- 
ements are common to many new routes: changes of variables, splits and discretizations. 
Changes of variables include the use of first-order formulations such as those of Cartan [7S] 
(credited by Wheeler as a route in his earlier account |376j ) or Palatini j273j . as well as 
the use of various sorts of spinors, twistors and the Newman-Penrose formalism j346[l29!2] . 
Among these the Ashtekar variable formulations (see 1.2.12.3) stand out for allowing new 
quantization possibilities. At least some of these formulations can be cast in terms of space- 
time or in terms of a foliation by spatial hypersurfaces. There are also other interpretations 
of the standard split such as the thin sandwich (1.2.9), and indeed other splits (see 1.2.12.4). 
Discretizations can also be spacetime-based (e.g Regge calculus j273j ^ or space-based (dy- 
namical triangulation pS])- These might be taken as approximations for numerics, but that 
logic can be reversed to suggest that these are the true formulations and that GR is but a 
macroscopic limit. 

Other elements to new routes are more bizarre - first principles or apparently-unrelated 
mathematics from which GR somehow drops out. Examples include obtaining GR from 
the algebra of deformations of a spatial hypersurface (II. 1, VI), obtaining GR from rela- 
tional 3-space principles in the TSA, and routes obtaining GR as an effective theory such as 
Sakharov's route above, or Jacobson's j2()7j (in which GR is deduced from black hole ther- 
modynamics). When QM considerations are included, even more colourful routes emerge: 
in particular the EFE's arise from the closed string spectrum (see 1.3.3.4). There are also 
as-yet incomplete attempts to recover spacetime (see the reviews (7^1 12(J2| ) . 




Among all these routes, I distinguish those to GR alone, those to GR with matter fields 
'added on', and those which attempt to be more: unifications of GR with other branches of 
physics. I emphasize the need to be comprehensive in including a full enough set of fields 
to satisfactorily describe all of nature to lend good credibility to the principles behind the 
route. Among the unified theories, I distinguish between total and partial attempts at uni- 
fication and between the classical and those which furthermore include QM. For example, 
already-unified Rainich-Misner-Wheeler theory (1.2.7.2), Weyl's theory (App III. A) and 
Kaluza-Klein theory (1.3.3.4) are attempted classical unifications of clcctromagnctism and 
gravity. The five superstring theories [or an as-yet undetermined, all-embracing M-theory] 
(1.3.3) are Theories of Everything - attempts at total unification of all classical and quan- 
tum physics. Credible unification should unexpectedly include extra physics (like light in 
electromagnctism) and (or) lead to verified predictions. In the absence of this one should 
apply Occam's razor. 

Finally, some attempted routes or the dropping of some simplicity postulates suggest 
modifications to GR. This is extra structure for its own sake rather than geared to wards 
the inclusion of other branches of physics. Relevant well-known alternative theories include 
higher derivative gravities. Lovelock gravity (see p 65) and dilatonically-coupled scalar- 
tensor- type theories such as Brans-Dicke theory (see III. 1.3). One can think of these theories 
as providing alternatives against which to test GR in certain regimes. Brans Dickc theory 
provided a theory to test GR against in the solar system. I worry higher-derivative theories 
might alter both the physics and the numerical modelling of compact binaries. 

1.7 Other classical matter fields 

There is much treatment of matter in the thesis so as to upgrade the TSA route to relativity 
alone to being a route with matter 'added on'. So I wish to couple a full set of matter fields 
capable of describing nature. I consider the fundamental fields, which are important since 
their flat spacetime versions account for all accepted physics bar gravity. Consequently these 
are valued as theoretical schemes, so I also hope to learn from the classical and quantum 
theory of these flat-spacetime theories so as to better understand the structure of GR (see 
below, 1.2.3.1, 1.3.2, App II. A). For the moment I present the usual fundamental fields in 
flat spacetime below, establishing much notation. The curved versions coupled to GR are 
introduced later (in 1.2.5-6 for standard formulations, and in IV and VI for the TSA). Other 
(in some cases decidedly nastier) fields come into play to wards the end of the thesis (VI- VII) 
as theoretical possibilities to help sharpen the TSA, and to see to which extent its workings 
were genuinely selective rather than selective by unwarranted simplicity assumptions. Here 
the possibilities that the TSA "hints at partial unification" or might have some control over 
the equivalence principle are investigated. The below treatment includes not just the usual 
Lagrangian formulation but also the Hamiltonian formulation because this thesis is a 3-1-1 
space-time split study, geared toward canonical quantization. 

1.7.1 Scalars and spin-^ fermions 

The Klein-Gordon theory of a scalar field ? with mass has the Lagrangian 

Lkg = -\{dA^d\ + m^) = ^{e-d,.d,- my) . (87) 

^^Furthermore, some routes do not quite lead to standard GR (e.g one might get EucUdean GR or 10-d 
GR, or a dilatonic rather than minimal coupling which could be thought of as interfering with the geodesic 
postulate part of GR) . 

^^The fundamental fields are those it makes sense to quantize and in terms of which all other matter fields 
may in principle be described. Of course this is to some extent a matter of taste: e.g particle physics may 
change in what is regarded as fundamental or GR might turn out not to be fundamental. 



The conjugate momentum is 



= ^ - • 



(88) 



and the Hamiltonian is H^g = ■^iT^'^+d<i-dq+m^<i^) . 

(89) 

The corresponding ELE is the Klein-Gordon equation Bc^+ni^q = , 

(90) 

which is a flat-space relativistic (i.e Lorentz-invariant) wave equation. For = 0, it is the 
flat-space wave equation. 

For use below, the Klein-Gordon theory for two scalar fields and c^2 may be re- 
expressed as the Klein-Gordon theory for a single complex scalar field, with Lagrangian 

Lrg = -dA'id^* + m1C<; (91) 
for ? = ;^(?i + ?2)- There is no reason to believe that Klein-Gordon theory is realized 
as a fundamental theory in nature. I use it as a toy (in 1.2.3.1 and 1.3), and note that its 
classical canonical structure is unaffected if one uses a potential of form F(<;) in place of 
m^q^. To date, fundamental scalars play only hypothetical roles in physics. Higgs scalars 
are postulated to give masses to certain fields, and infiaton(s) are postulated to explain 
certain features of the early universe. More complicated scalars also occur in scalar-tensor 
theory (III. 1.3). 

Dirac investigated whether the Klein-Gordon operator might be the square of some 
linear operator. Indeed, he thus arrived at the linear Dirac equation^^ 

{il^pdA-m^5p'')r = ^ (92) 



for the Dirac matrices obeying the Dirac algebra 7'^7^-|-7'^7^ = 2r]^^ 



(93) 



I work in the chiral representation in which the 4-spinor 



(94) 



where D and L stand for right- and left-handed, and 7*^ 



are the chiral representation Dirac matrices, and 



1 

1 
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-i 

1 
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-1 



are the Pauli matrices. 
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(95) 



(96) 



The Dirac equation is then i(— (?o + o'"5a)V'D = rn^il^i, , i(— (Jg ~ cr"'da)ijjL = n^vV'D ■ 

(97) 



^^I use lower-case greek indices running over 1 to 4 for 4-component spinor indices, and typewriter capital 
indices running over 1 to 2 for 2-component spinor indices. I often suppress these spinorial indices. 



Clearly if = 0, this decouples, into two 2-component Weyl equations: 

i{-d-o + a^d-a)i^B = , i{-d-o - a^d-a)i^L = . (98) 

The Dirac equation gives a highly successful theory of the electron (and of the predicted 
and successfully observed positron), whereas the L-Weyl equation gives a theory of the 
neutrino. Note that I am treating these things classically here. They are usually treated 
quantum-mechanically. 

I next provide 2 forms of Dirac theory Lagrangian. Defining -0 = ip^j^, 



■-Dirac ~ ^ 



(9L 

The conjugate momentum of Dirac theory is curious: vr^ = — r = —iip^ . 



(100) 



The Dirac Hamiltonian is then Hoirac = —itp^'y-dip+rmljip (= iip^ijj) . 

(101) 

Finally, Yukawa theory (which may be regarded as a toy, a failed theory of the nuclear 
forces or how the electron may have gained its mass from a Higgs scalar) follows from 

Lttwa = + L^'^ - g^V^^, . (102) 
Note that the interaction term does not disrupt the classical canonical structure. 
1.7.2 Electromagnetism, U(l) scalar gauge theory and QED 

Pure electromagnetism (no sources) follows from the Lagrangian 

Li = -ip o F = -i[(A + d<^f - B'] . (103) 



dl ■- - 

The conjugate momentum is tt°' = — — = —A'^—d"'^ = E°- 

dAa 



The Hamiltonian is then Hem = -^(vr^+i?^)+<I>9-7r 



(104) 



(105) 



up to a total divergence. The last term has the form of an appended constraint. 

A common gauge choice is the Lorenz gauge d^A^ = so ()74(1 in vacuo =^ UA^ = 0, 

(106) 

I now consider coupling the theories of the previous section to the electromagnetic field. 
The usual treatment is to consider a complex scalar field, note that its Lagrangian is 

globally U(l) symmetric, i.e invariant under — > e^^^(, , A = const , 

(107) 

and demand that the symmetry also holds locally i.e for A = A(x). Then one finds that Oa'^ 
would contain an extra unwanted term. Just as on p 8, one introduces a new term whose 



transformation properties compensate for the unwanted term. This is gauging, a procedure 
originally due to Weyl |372j . Again the introduced term is a connection, whereby one 

forms a covariant derivative VJ? = d^q+ieA^q , V^?* = d^c^* —ieA^<;* , 

(108) 

Then in place of the Lagrangian of complex Klein-Gordon theory, one arrives instead at 

Lua) gauged = -V1?V^^,* + m,,,* . (109) 

One then requires to be a dynamical field. Standardly, this is taken to be the field 
uniquely selected by Lorentz-, gauge- and parity- invariance plus the flat-space QFT restric- 
tion on higher-order interaction terms (explained in 1.3). This is of the same form as (and 
is furthermore identified with) the electromagnetic potential. So the resulting U(l) scalar 

gauge theory Lagrangian is L^(^)^ = L^J^^ gauged+^L • 

(110) 



For spin-2 fermions, there is a global U(l) invariance under ip — > e^^ if) , A = const . 

(Ill) 

Gauging this, the Dirac Lagrangian may be replaced by 

~^vt) gauged = ^V^T^n^ - ^^^'^ = \ (V^T^nV' - - m^i''^ . (112) 

Adjoining this to the electromagnetic Lagrangian to make A^ into a dynamical field, one 
has the classical theory whose QM counterpart is quantum electrodynamics (QED): 

■-QED — '-u(l) gauge + "-em • \^^'^) 

Note that the classical canonical structure of the free theories does not significantly 
differ from that of the coupled, interacting theories. Also, in the above theories the elec- 
tromagnetic vacuum laws are replaced by non-vacuum laws with fundamental source terms 

3f = iec;V'^^C , j^ = ei^-f^^, (114) 

where the big double arrow indicates the derivative acting to the right minus the derivative 
acting to the left. 

QED is regarded as a highly successful theory: it predicted the currently-observed 
anomalous values of the magnetic moment and charge radius of the electron, as well as 
the small Lamb and Uehling shifts in spectral lines j364j . 

1.7.3 Yang Mills theory, QCD and Weinberg Salam theory 

There remains the issue of the powerful but apparently short-range nuclear forces. The 
Yang-Mills idea was to consider gauging with respect to a larger gauge group G (they 
considered 

SU(2) |387j . but it was soon generalized j360j to a large class^^ of gauge groups^^). 
^*This class was precisely determined by Gell-Mann and Glashow, see App IV. B. 

^^The gauge groups considered are internal, as opposed to the space(time) groups of 1.0.1. All these groups 
are Lie groups, i.e simultaneously groups, topological spaces and analytic manifolds. Near the identity they 
may be studied via the corresponding Lie algebra G. This is an algebra with a product |[ , ]| : G x G — > G 
that is bilinear, antisymmetric and obeys the Jacobi identity 

\[9i, \[92,93]\]\ + \[92, \[g3,9i]\]\ + |[ff3, |[5i,32]|]| = (115) 
V 9i, 92, 9i (z G. The r are generators. I usually use [i]-indices over these, but in the particular case of 



Consider a multiplet of scalar fields ^ with arbitrary potential V{<^) with a Lagrangian 
invariant under the global symmetry ^ — > g'WA-r ^ ^ const 

(117) 

and demand that the symmetry also holds locally i.e for Qa. = QAix)- Then one finds that 
5/1? would contain an extra unwanted term. Thus one requires to include a compensatory 

connection obeying ^a'^a — gA^TA9~^ -{0^9)9'^ , 

(118) 



thus leading to the formation of a covariant derivative V^<j = d^<;+ig^A^TA'i ■ 

(119) 

Thus one has passed from the "j-Lagrangian to a G-gauged <^-Lagrangian. 

Lg gauged = - E V^?V^\ + V^(0- (120) 
?-multiplet 

One may similarly pass from the Dirac Lagrangian to 

L&tuged = ^7^V^V - m^^^ = ^ (V^T^V^V - (V^Vi)7^^) - m^^i^ . (121) 

One then requires ^4^ to be a dynamical field. Again, by Lorcntz-, gauge-, parity 
invariance and the QM argument which cuts down on higher-order interactions, one arrives 
at the pure Yang-Mills theory Lagrangian^^ 

LiM(G)=-^Fi°F' (122) 

where the Yang-Mills field tensor is Fj^^ = d^A^^ — d^Ai^ + g^\[A^, A^]\i. is the 
coupling constant of the theory. 

The pure Yang-Mills field equations are 

^G^GIA^B] ^ Q . ^ g_glA^B] ^ _gJ^^^A\F^AB ^ (123) 

so one can see this as a bigger, self-sourcing and hence nonlinear version of clcctromagnctism, 
corresponding to messenger particles which, unlike the photon, are themselves carriers of 
the theory's charge. The 3-1-1 split is 

L3|i(YM(G)) = -^i^IoFI+^(aoAI,-5,^Io+l[^s,^o]|I)(5°^''-5'^'°+|[^^^°]|') , (124) 



Qj = D-aA'^j = d-aA'^j - gJijK^s TTi" = , (125) 
5G5G[M? = ^(4-5^$i). (126) 



internal gauge groups, I use bold Capital indices. The Q's are auxiliaries parameterizing the amount of each 
generator. |[ta,tb]| = if^ABi where /*^ab the structure constants. From the properties of |[ , ]|, these 
obey 

/ijK = /i[jK] (antisymmetry) , 
/'jk/ilm + /'jm/ikl + /Sl/imk = (Jacobi identity) . (116) 



This is self interacting theory of spin 1 bosons alone rather than Yang-Mills gauge theory, taken to mean 
Yang-Mills-'scalar and (or) Dirac' theory. 



The conjugate momenta are 



(127) 



and the Yang-Mills Hamiltonian is HyM = t^iT^I+^FioF^+AqGj . 

(128) 



une tnen nas Lq — "-g gauged +'-ym(G) 



(129) 



rip,ip,Ai _ rV':'0 , rAi nQn^ 
■-G — "-G-gauged + '-YM(G) • K^'^^) 

In these, additionally to the self-sourcing in (|12.S)) . there are source currents similar to those 
of the previous subsection. 

Yang-Mills gauge theory was originally ridiculed as a theory for the nuclear forces, for 
surely such short range forces would be associated with massive messengers. However, by a 
distinct argument in each case, this difficulty has been overcome to form successful theories 
for the weak and strong nuclear forces. 

For the weak force, the idea is Higgs' local spontaneous symmetry breaking |18,S| K-iHlj . 
The messenger bosons may indeed have mass, since they may acquire it from as-yet experi- 
mentally hypothetical Higgs scalar fields. This study requires the Yang-Mills-scalar gauge 
theory. 

The weak nuclear force accounts for observations of parity violation. The accepted 
theory for this force is contained within the Weinberg-Salam unified theory of electroweak 
interactions, which is based on the spontaneously-broken SU(2) x U(l) Yang-Mills theory. 
The theory's Lagrangian is a complicated combination of Yang-Mills and appropriately 
gauge-corrected Dirac and Weyl pieces together with multiplets of Higgs scalars to break 
the symmetry, in which the left and right multiplets are treated differently. This is the 
theory of 3 messenger bosons (W^"*", W~ and Z^) in addition to the photon, and of 
electrons, positrons and neutrinos. Weinberg-Salam theory explained a number of nuclear 
reactions, and correctly predicted the existence of (and masses for) W~^, W~ and bosons 
and effects due to the neutral currents associated with Z^. 

The strong force is not accounted for by spontaneous symmetry breaking, but rather 
by the idea of confinement: that the coupling constant rises over long distances. There 
is evidence for the strong force having 3 charge types from scattering cross-sections. This 
led to postulating an SU(3) theory,^^ quantum chromodynamics (QCD), which in turn led 
to the eightfold way^^ 'periodic table' of nuclear physics, a pattern partly explaining the 
known spectrum of hadrons and partly correctly predicting hadrons not yet known at the 
time. The charge is called colour, the messengers are the 8 coloured gluons, and the hadrons 
are composed of coloured, fractionally-electromagnetically-charged particles called quarks. 
Confinement, which follows from a theoretical property of QCD called asymptotic freedom, 
accounts for quarks and gluons not being observed. But jets in high-energy collisions are 
regarded as indirect evidence for quarks. QCD also explains some low-energy physics of 
pions and nucleons [but this has the U(l) Problem, see IV. 1.3. 3]. 

The Standard Model (SM) of particle physics is the SU(3) x SU(2) x U(l) collection 
of the above, repeated 3 times over [due to the known particles of nature curiously and 
unexplainedly apppearing to belong to 3 increasingly more massive copies (generations) of 
an otherwise identical set of particles] . 

■^^SU(n) has dimension n and - 1 independent generators. 



2 On the split formulation of Einstein's field equations 



2.1 Geometry of Hypersurfaces 
2.1.1 Extrinsic curvature 

Consider 2 nearby points pi and P2 lying on a curve in K^. The arelength between these is 
As = ^/Ax^~+~Ay^. Let the tangents at pi and p2 be ti and t2 and the angle between them 



lim Atl^ 

be Atb. Then another type of curvature at pi is given by k = „ — 

^ "^^ ^ As — >■ As 



pi 



dtp 
ds 



Note that this can also be interpreted as the rate of change of the normal, as used instead 
in the modern generalized definition below. Next, if the curve is in K"', then near each 
pi the curve lies within a plane. Then use the above notion of curvature. This notion of 
curvature is extrinsic, i.e referring to how the curve is bent relative to its surrounding 3ft^ 
exterior. To generalize this notion of extrinsic curvature from curves to 2-sur faces within 
Euclidean 3-space draw each plane through the point p in turn. In each plane the 
2-surface traces a curve. Apply the 2-d notion above to determine the curvature. The 
maximum and minimum values thus found are the principal curvatures ki and K2- Then 
one has the following measures of how the surface is bent within 3?^:^^ 

(mean curvature) = ki + K2 , (131) 



(Gauss curvature) = K1K2 ■ (132) 

2.1.2 Gauss' outstanding theorem 

Despite being defined in very different ways, it turns out that the intrinsic and extrinsic 

notions of curvature of a surface are related. For a 2-surface embedded in 3?^, the intrinsic 
curvature is in fact equal to (in the above convention twice) the Gauss curvature: 

R = 2kiK2, (133) 

which is Gauss' outstanding theorem. 

2.1.3 Hypersurface geometry 

I now generalize the above result. First, allow the embedding space itself to also be curved. 
Then the result is in fact about a projection of = Rabcd, the flat-space Riemann tensor. 
Thus one is to set up a theory of projections. Although for the above one can get away with 
less (the projections of = Rab suffice for 3-d), the second generalization is to arbitrary-d 
with the role of the surfaces being played by hypersurfaces of codimension 1, which generally 
requires the projections of the full Riemann tensor. The third generalization is to permit 
the embedded and embedding spaces to be of arbitrary signature. I denote a manifold with 
r independent spatial directions and s independent timelike directions as an (r, s) space (of 
dimension n = r + s and 'signature' s) and denote the embedding itself by (r, s; e) where 
the 'added dimension' /Lt is spacelike for e = 1 and timelike for e = —1. 

To set up the theory of projections, one takes the (n + l)-d metric gen and expands 
it in terms of a basis consisting of the normal to the hypersurface ua and the n projectors 
e^A- 

gCD = hcD + encno , (134) 

■^^Thc overall numerical factors in these definitions are unimportant. The mean curvature is often defined 
as a true mean i.e one half of the above. The forms given are the sum and product of the eigenvalues i.e the 
trace and determinant of the extrinsic curvature matrix. 



where the first term is the metric induced by gcD on the hypersurface, hcD = 9cd^'^c^'^D- I 
am interested in the study of projections onto the hypersurface and onto the normal. I use 
0...a... = 0,,,A...^^a to denote projections onto the (r, s) hypersurface v, and 
0...±... = d,,,A...n'^ to denote projections onto the normal. Then for the metric, 



g±± = e , 5a± = . 
In general, the higher-d metric is split according to 



(135) 



9CD 



, SO that g' 



CD 



(136) 




The extrinsic curvature is Kab = —^anb = S^hab , 

(137) 

which is a measure in terms of the rate of change of the normals of how much the hyper- 
surface is bent in the surrounding spacetime.^^ The second form of (|137|) shows that the 
extrinsic curvature is manifestly symmetric. Its other notable property is that it is both a 
n- and (n+l)-d tensor since Ka± = 0. 

The three projections of the Riemann tensor (see e.g jl31l l23L)j ) are respectively the 
Gauss equation, the Codazzi equation and the Ricci equation: 

Rabcd = Rabcd — '^^Ka[cKd]b , (138) 



R±abc = -^eD^cK^a , (139) 

R±a±b = -i^^Kab - eDbDaa) + Ka'K,b . (140) 
From these it follows by one contraction that 

Rbd - eR±b±d = Rbd - e{KKbd - Kb'K^) , (141) 

Ga± = Ra± = -e{DbK\ - DaK) , (142) 
65K - eD^a 

R±± = — KoK, (143) 

a 

and the Gauss equation can be contracted a second time to give 

2G±± = 2R±±-eR = -eR + K^-KoK , (144) 

where I exhibit the useful relations to the projections of the Einstein tensor Gab- 

^■^The hypersurface derivative is = — £0. Note that the correction to ^^jf- is fphab ~ 2-D(a/3i,) = 
(jK/3)ab for |K the Killing form. 



(|144jl is indeed Gauss' outstanding theorem for a 2-surface embedded in flat 3-space: 



R = K"^ - K o K = i ^ ka\ - ^ = kI + 2kiK2 + - (k^ + k^) = 2kiK2 . 

^A=l,2 / A=l,2 

The following geometrical identities are useful for GR. (|14()j) and (|141j) ^ 

-{SgKab - eDbDaa) - KKab + 2Ka'Kbc + eRab = eRab , (145) 
which has the trace [a linear combination of H144() and (|143() ] 

-{5-nK-eD'^a)-K'^ + eR = eR-R^^. (146) 
a 

___ 6^K - eD'^a K o K + K"^ R 

Another linear combination of (|144l) and (|143() is — = e — . 

a 2 2 



From this and (|145j) it follows that — e{5^Kab — hab^pK) — D^Daa + habD^a 



+ e { 2K/Kb, - KKab + ^°\'^^^ hab ]+Gab = Gab (147) 



which forms the last projection of Gab- 

The above equations (|138m4j|) are the starting point for embedding theorems with 
codimension 1. If one sets all the higher-d curvature tensors to be zero in (|138H14(7|) one has 
the equations determining embeddability into flat manifolds. If the Ricci curvature is set to 
zero in H141H143|) one has the equations determining embeddability into Ricci-flat (vacuum) 
manifolds. Locally, whereas the former is not guaranteed, the latter is (assuming analyticity: 
the Campbell-Magaard result, see 1.2.3 and B.l). The higher codimension generalization 
permits all spaces to be locally embeddable in higher-d flat manifolds. Global results involve 
much harder mathematics and much larger lower bounds on the required codimension jl65) . 



2.2 Split of the EFE's with respect to nowhere-null hypersurfaces 

Consider slicing some region of spacetime into a sequence of hypersurfaces of codimension 
1. The working is for general (r, s; e). The hypersurfaces are held to be everywhere of fixed 
signature and are not allowed to be null. Whereas my main interest is in the 
(3, 0; -1) case i.e in foliating the usual 4-d spacetime with a sequence of 3-d spatial slices, 
I will also investigate the crucial role of these particular values of s and e (the overall 
dimension itself is of little consequence). Also, the application to thin matter sheets (Part 
B) has been conventionally studied as the (2, 1; 1) case in the ordinary GR or as the 
(3, 1; 1) case in GR-based braneworld scenarios. The framework of this section is broad 
enough to be of use for all these applications. One should also note that whereas one can 
perform an infinity of different slicings^^ in GR, particular slices and slicings are useful 
in numerical relativity (see 1.2.9-11) and privileged slices and slicings occur both in some 
alternative theories in II. 2 and as the thin matter sheets themselves in Part B. 

I use the arbitrary (r, s; e) "ADM-type" split H136p ^ of the higher-d metric qcd (|136|) 
with respect to the (r, s) 'initial' hypersurface Tq. The following restrictions are associated 
with such splits. One requires the foliating hypersurfaces to be of a fixed topology. One may 
require further restrictions on the spacetime to ensure good causal behaviour. In the (n, 0; 

■^■^By a slicing I mean a sequence of non-intersecting hypersurfaces that cover some spacetime region. This 
is also known as a foliation. GR happens to be a foliation-invariant theory as explained in (1.2.6). 

^^I use unnecessarily doubled quotation marks to cover either general-signature or specifically nonstandard- 
signature analogues of concepts usually associated with a standard signature. 



-1) case, note that the first of these prevents consideration of topology change, while the 
second takes the form that the spacetime is to be globally hyperbolic (1771 1363] although 
sometimes one can get away with working locally instead. 

In the (re, 0; -1) case, the foliation is to be interpreted in terms of a choice of time t and 
associated timeflow t^. There are an infinity of choices of such a t. One usually demands 
the spacetime to be time-orient able so that it is always possible to consistently allocate 
notions of past and future. For this (re, 0, -1) case t is truly temporal. In the other cases, 
to start off with one can think of the sequence of hypersurfaces being parameterized by an 
"independent dynamical variable" (IDV) fi. 

In the (re, 0; -1) case, the lapse may be interpreted as the change in proper time 
dr = a{t, x^)dt, a = n'^ta- The interpretation of the shift is the displacement in identification 
of the spatial coordinates between 2 adjacent slices, /3* = h^Hj. Analogous notions can be 
defined for the general case. 

Applying an "ADM-type" split of the metric splits the EFE's into two systems of 
equations to be studied as two separate steps. In the (re, 0; -1) case these steps are the GR 
Initial Value problem (IVP) and the GR Cauchy problem (CP). By problem I mean a p.d.e 
system that holds in some region together with prescribed data on (some portion of) the 
boundary dil.. 

Here I consider the general (r, s; e) case with phenomenological matter. One could 
instead have fundamental matter (see 1.2.3.4). In sufficiently complicated cases, the matter 
is governed by separate field equations which should then be adjoined to the EFE's to 
complete a coupled Einstein-Matter system. The "IVP" system consists of n + 1 constraints 
(4 for the usual GR): the Gauss and Codazzi "constraints" 

-KoK-eR = 2G±± = 2T1^ = 2p, (148) 



-eiDhK'a - DaK) = G. 



a± 



(149) 

These are obtained by use of the EFE's in (|144() and (|142() . They are "constraints" because 
they contain none of the highest derivatives with respect to the IDV /i. Their solution 
involves only the "initial hypersurface" . 

The remaining "^"^^"^^ equations (6 for the usual GR) are "evolution" equations with 
respect to // 



1 



a 



(S^Kab - eDbDaa) - KKab + ^Ka^K^c + eRab = eRab = e { S, 



S 



re — 1 



-h, 



ab 



P 



re — 1 



-h 



ab 



(150) 

obtained from the EFE's and H145() . Here p, ja and Sab = Tab (N.B S ^ T) are general 
matter terms which are usually prescribed as functions of matter fields that are governed 
by usually-separate field equations. A useful equation is the trace of (|15U() 



5pK = a{ -eR 



up + eS 
re — 1 



+ eD a = a 



KoK + 



{n-2)p-eS' 
re — 1 



+ eD'^a, (151) 



the second equality of which follows from the Gauss constraint (|148|) . 

An alternative ^Gab rather than ''Rab form for the evolution equations follows from the 



identity ((T17|) : 



a L 



e{5pKab - habSgK) - DbDaa + habD'^a 



K o K + K'^ 

+ e ( 2Ka''Kbc - KKab + ^ hab ]+Gab = Sab ■ 



(152) 

Note how the form (|15U|1 has simpler gravity terms but more complicated matter terms 
whilst the opposite is true for ()152() . 



I often use the decomposition of symmetric second-rank tensors S^f, into their trace part 

S = S.fe/i"'' (153) 



and their tracefree part = S^fe hat- 

n 

In particular if this spht is apphed to Kab, (!148j) and (|149j) take the forms 

oK^ + ei? + 2p = , 

n 

DbK^\ - "^DaK + eja = , 
n 

whilst the second form of ()151() may be rewritten as 

\5.K - eD^a) - ^ = o + {n-2)p-c 
a ^ n n — 1 

2.3 Analytic approach to the GR "CP and IVP" 

2.3.1 Cauchy problems 

For a given d.e system of order p to be solved in a spacetime region 0, for functions ipA, 
the Cauchy problem (CP) consists of the d.e system together with the ipA and their time 
derivatives up to p — 1 th order on some initial hypersurface T,q. It is required that the 
initial hypersurface be nowhere-characteristic i.e a genuine spacelike surface. One could 
alternatively work with characteristic hypersurfaces (1.2.12.4). 

I consider CP's for a number of physically well-motivated systems, in order to isolate in 
simpler examples a number of features of the GR CP. These second-order systems recur- 
ringly provide a test-bed for GR ideas and all the field-theoretical examples will furthermore 
be coupled to GR. I work from the viewpoint of the analytic method, stressing the impor- 
tance of what input to prescribe and what output to expect from the equations of motion. 
In other words, which p. d.e problems are meaningful for the equations that describe nature? 

In Newtonian mechanics, if one is prescribed the positions and velocities of all the 
particles at a given instant of time (the Cauchy data) then one can in principle predict^^ 
these at all future times by solving the second-order o.d.e that is Newton's second law 
(|33() . Existence, uniqueness and other important properties are well-understood for o.d.e's, 
although in practice limited knowledge of real-world initial data may result in the onset of 
chaotic behaviour. 

Most of the laws of physics are however represented by p.d.e's, which are harder to treat 
than o.d.e's. For example if one is prescribed a flat spacetime Klein-Gordon scalar <f and 
its velocity everywhere (the Cauchy data) within a compact region at a given time, one 
can in principle predict the wave and its velocity everywhere within certain future and past 
regions^^ by solving the Klein-Gordon equation H9U|) . As a first simple case, if the Cauchy 
data are analytic functions, one can assert the local existence and uniqueness of an analytic 
solution to the wave equation from the following theorem. 



(154) 
(155) 



(156) 
(157) 



Cauchy— Kovalevskaya theorem: Suppose one has A unknowns ^'a which are functions 
of the IDV fj, and of Z other independent variables xz- Suppose one is prescribed on some 

■^^Or retrodict - all the accepted dynamical laws of physics are time-reversal invariant. 
■^^I will be precise about what is meant by these regions in the next subsection. This is to do with the 
Klein-Gordon equation respecting causality, a feature shared by all the laws of physics below. 



domain Q the A p.d.e's of order k of form Q^^k)" = -^A for the Fa functions of ^u, xz, 
^'a and of derivatives of ^'a up to {k - l)th order with respect to n, where furthermore 
in 0, the Fa are analytic functions of their arguments. Suppose one is further prescribed 
analytic data ^'a(0, xz) = ^^''dAixz), ■■■ , ^g^(fc-?)'°' (Q? ^z) = ^'''^^dAixz) on some piece U of 
a nowhere-characteristic surface. Then this problem locally possesses precisely one analytic 
solution. 

The meaning and applicability of this theorem are elaborated in 1.2.3.2 and B.l. 

Complications arise in attempting to repeat the above simple approach for 
electromagnetism. Upon using E = —dA^ — A in the vacuum version of the inhomoge- 
neous Maxwell equations (|62[ I63|) . one finds that one of the equations contains no time 
derivatives of E whereas the other three do. This corresponds to electromagnetism being 
constrained. The presence of the linear vacuum Gauss constraint 9-E = is associated with 
the gauge freedom ()75() . Because of the constraint, electromagnetism is strictly everywhere- 
characteristic, but one usually forms the problem for the three evolution equations (|63j) 
subject to already-constrained initial data and refers to this as the Cauchy problem for 
electromagnetism. To set this up one requires 

1) the electromagnetic field and its velocity at a given instant throughout some region. 

2) To fix the gauge freedom. In Lorenz gauge ()1U6() the evolution equations are of the 
correct form to invoke the Cauchy-Kovalevskaya theorem. 

Note that for the theory to make sense nothing is special about the initial time. So one 
would expect the constraint to hold for all times. Fortunately, the evolution equations 

d 9E 

guarantee this happens: ^(9-E) = ^'"^ = d-d x B = 

i.e the evolution equations propagate the constraint. 

The above analysis can be repeated for the 4K vacuum flat-space Yang-Mills equa- 
tions, giving rise to K constraints 1)1251) corresponding to K gauge freedoms ()118() . and 3K 
evolution equations (|126|) . The K constraints are similarly propagated by the evolution 
equations. A Lorenz- type gauge fixing permits use of the Cauchy-Kovalevskaya theorem as 
before. The novel feature of these as compared to electromagnetism is that they are non- 
linear equations which vastly complicates their behaviour and solution (eg superposition is 
no longer of use). 

The EFE's manifest all the above complications, being a constraint-gauge system and 
nonlinear (in fact the EFE's are quasilinear). They also have further complications of 
their own: one is dealing with a theory of spacetime itself rather than working on a fixed 
background, which is considerably more difficult particularly from a global perspective (see 
2.3.6). I now consider the simpler results of the (3 ,0; -1) GR CP, most of which hold 
irrespective of both e and s. 

2.3.2 Simple signature-independent features of the GR CP 

First, it was soon realized (e.g by Hilbert |34()j and Birkhoff-Langer jSl]) that there are 
n + 1 restrictions on the system (4 for the usual GR). Thus there are n + 1 constraints 
()148H149l) . These were explicitly provided by Darmois [HHlinni, although it has escaped 
attention that Campbell also provided them.^^ The solution of the EFE's is then to be a 
2-step procedure. 

The "initial value" or data construction step is to construct data {hab,Kab) together 
with coordinate functions p and ja all on some portion Uq of Tq by solving the constraints 
(see 1.2.9). 

^^Darmois' EFE split used normal coordinates a — 1, (3 = Q while Campbell used a arbitrary, (3 — Q. 



The "Evolution" or embedding step is, given such data, to solve the evolution equations 
H137|) and 1)150(1 as a prescription for evaluating {hab, Kab) on some portion of a nearby 

i.e for a small increment of the dynamical variable /i. 
This is the logical order of the steps, but historically the second was tackled before the first. 
The issue of what input is to be prescribed and what output is to be expected has played 
an important role in developing the above procedure, through the works of Lichnerowicz 
|253j and (Choquet)-Bruhat [HHllE^inS) and through Wheeler's questions |375| 1376] and 
those works answering these questions (see 1.2.9). 

Second, provided that the constraints hold on some "initial" Uq, they are guaranteed 
to hold throughout the embedding spacetime region. This is most quickly seen from the 
contracted Bianchi identity (|29|) . Understanding of this is implicit in and it is made 
explicit in [001, standard knowledge today: together with conservation of energy- 

momentum, and then making use of the Einstein tensor projection form of the constraints 
nA = G\- fj^ and the EFE's: 

= VsG^-VBTf 



= -{GA^-fA^)+d,{GA'-fA')-{GA''-fA'')t^BL + {GB^-fB^)t''AL^nA • (158) 

Third, constraints come hand-in-hand with gauge freedoms. Thus n + 1 of the metric 
components are freely-specifiable: the coordinate functions painted onto the geometry have 
nothing to do with the geometry itself. This is the content of general covariance. I will 
explain how many applications are tied to particular, careful gauge choices. For the moment 
I just consider either the normal coordinate gauge {a = 1, Pi = 0) or the harmonic gauge 
coordinates such that V^x^ = or equivalently = g^^fsc PSI- 

Fourth, by a suitable choice of gauge one can ascertain by the Cauchy-Kovalevskaya 
theorem for admissible analytic data that there locally exists a unique solution to the 
evolution equations (|15U|1 . For this and further applications in the GR CP, the harmonic 
gauge is usually chosen. However, I make the following observations about the use of this 
theorem. 

1) It is the most basic theorem for p.d.e's. 

2) It is a very general theorem in the sense that it holds for all sorts of p.d.e's. This is 
reflected by the signature (principal symbol)-independence of the applications and by the 
irrelevance of what analytic function i<A is, in, e.g, extending vacuum proofs to include 
fundamental as well as phenomenological sources. 

3) Moreover, it is a very restrictive theorem in that it relies entirely on the functions 
being analytic, which seriously limits its applicability. Without the analytic functions, no 
extremally general theorems in the sense of 2) are known or plausible. 

Furthermore, the analytic functions are inappropriate for any causal study (i.e in any 
sort of relativistic physics) since they are rigid (a change in a small region of an analytic 
data set induces a causally-unwanted change of the entire data set). 

Thus the application of the Cauchy-Kovalevskaya theorem to GR was never a serious 
consideration in the mainstream literature. It would have been regarded as trivial because 
of observation 1) and not of serious interest because of observations 2) and 3). The point 
of the pre-1938 literature is that no better results than this were available, a serious lack. 

Fifth, whereas it appears to simplify the EFE's, the use of the normal coordinate gauge 
in the GR CP is discouraged because in practice it typically breaks down quickly. This 
follows from the normal gauge Raychaudhuri equation that follows from (|157|1 : 

k =K'^ oK"^ + R^^ > 0, (159) 

n 



where the inequahty follows from the definition of the strong energy condition: 

the general Raychaudhuri equation term RabW^u^ > V unit timelike , 

(160) 

by picking = = t^. To make (|160p manifestly an energy condition, one uses the 

T 

EFE's to obtain Tabu'^u^ > V unit timelike . 

n — 1 

(161) 

Integrating the differential inequality in (|159j) shows that if Kq > 0, then \K\ — > oo within 
a finite amount of parameter vr = along t^. This blowup is by definition a caustic 
and signifies a breakdown of the normal coordinates. However, for the (r, 1; 1) embedding, 

_ ^A -g spacelike. There is then no good reason for R±± to be always positive, so 
caustics may be less likely to form. 

Elsewhere in the literature there is an old and supposedly little-known embedding result 
[71 \ I266| I312j . However, I identified it |14[ I16j as partly following from the above results, 
and consequently argue that a number of its supposed modern applications are highly 
questionable (see B. 1.2-3). 

Campbell Magaard embedding theorem: Any n-space can be 'locally surrounded'^^ 
by (n + l)-d vacuum space (where both spaces in question are non-null but otherwise of 
fixed arbitrary signature, and are both analytic). 

This followed from Campbell splitting the vacuum (r, s, e) EFE's independently of Dar- 
mois' work that blossomed into the GR CP and IVP. As above, the given proof of the 
result is subdivided into an embedding or "evolutionary" step considered here and a data 
construction step, "Magaard's method", considered in 1.2.9.2.3. What Campbell offered 
as 'proof for the former was incomplete and indeed spurious by the Bianchi identity. All 
of the actual proof is contained in Magaard's use of the Cauchy-Kovalevskaya theorem, 
which is thus just a version of the fourth result above. Thus it is well-known for the GR 
CP, and follows for the other (r, s; e) embedding cases from the well-known insensitiv- 
ity of the Cauchy-Kovalevskaya theorem to (r, s; e). Thus this step is in fact obvious 
and well-known. Moreover, as argued above, it is trivial and inappropriate. Strictly, the 
Campbell-Magaard result is the vacuum case i.e Tab = 0, a fact which has been much 
vaunted, but its proof may trivially be extended to any (analytic) functional form of Tab 
jl4j (since the use throughout of the Cauchy-Kovalevskaya theorem is totally insensitive 
to changes in such subleading order terms, which merely contribute to -Fa)- Thus there is 
a 'generalized Campbell-Magaard theorem'. Moreover, the above results for the (n, 0; 1) 
GR CP case are protected and extended by a host of further theorems which, as argued in 
the next subsection, do not carry over to other signatures. There are also difficulties with 
"Magaard's method" given in 1.2.9.2.3 for s = 0, e = —1 and especially in B.l for s = 1, 
e = 1, where I conclude about this issue. 

2.3.3 Signature-dependent GR CP results 

The vast majority of serious results |M1 ESU El CHI IMl El ESI Hill for the GR 

CP turn out to be entirely dependent on the lower-d signature s = 0. In other words, the 
choice of methods which properly respect the difference between space and time is absolutely 
crucial. In this subsection I restrict to documenting the standard results. I explain why 
these techniques quite simply do not have any direct counterparts for other signatures in 
Part B. 



By B 'being surrounded' by A I mean that there is an embedding of B into A. What is meant by 'locally' 
here is not particularly clear in the literature, and will be developed in 2.9.2. 



To have a sensible problem in mathematical physics, one requires not just local existence 
and uniqueness of solutions but rather well-posedness. Whereas well-posedness always in- 
cludes local existence and uniqueness, it also always includes continuous dependence on the 
data, without which an arbitrarily small change in the data could cause an arbitrarily large 
immediate^'' change in the evolution i.e there is no guarantee of physical predictability from 
such a problem. For a hyperbolic-type system one further requires a domain of dependence 
(DOD) property to enforce a sensible notion of causality. If the data are known only on a 
closed achronal set S (that is, a piece of a spacelike hyper surf ace), then the evolution can 
only be predicted within a region T>~^[So] (the future DOD), defined as the set of points such 
that all past-inextendible causal curves through each point [represented by the 7's leading 
to the point p in fig 5] intersect Sq. In my opinion (substantiated in B.l) causality can 
effectively be studied only in settings where the IDV is time. 



Figure 6: Domain of dependence property. 

One should not claim to be able to predict too great a portion of the future. Given data for a hyperbolic 
system on a piece of a spacelike surface, one can predict the future only in the forward wedge (domain of 
dependence) within which all causal curves, i.e allowed paths of information flow, emanate from the data. 
Outside this wedge, for all we know, large disturbances could influence the future arbitrarily soon. 

Note that further grounds of insufficiency for the use of the Cauchy-Kovalevskaya the- 
orem are that it gives no control whatsoever over these last two properties. The analytic 
functions are also undesirable because they are too restrictive a function space^^ to cover 
a number of interesting studies (e.g involving boundaries, discontinuities or low differentia- 
bility upon approaching some spacetime singularities). Piecewise, rougher function spaces 
are required. 

Serious theorems for GR were obtained in 1938 by Stellmacher |343j . and more ex- 
tensively by Choquet-Bruhat in 1952 (jESl) and 1956 ([M])- These advances were tied to 
progress in the (signature-specific!) general theory of nonlinear hyperbolic p.d.e's. For 
example, Bruhat made use of the (n, 0; -1) EFE system (|148H150|) being of 

the correct quasilinear hyperbolic form £^^(x, ^I'a, Va^'b)VcVd^e = J^eix, Va^'b) 



function J^e are smooth) to enable use of Leray's theorem |2511 1363j , which guarantees local 
existence and uniqueness, and furthermore continuous dependence on the initial data and 
the DOD property. 

The above proofs of the four well-posedness criteria for the GR CP are now usually 
done using Sobolev spaces ISESl EZI GSH IHH EH EIZI- At first these lend themselves to 

■^"See e.g page 229 of ^i^: I do mean immediate, not an issue of chaos or unwanted growing modes (though 
well-posedness often also bounds the growth of such modes 145 ). 

Whereas Hawking and Ellis |177| argue that the choice of function space used to model nature does not 
matter since it is not experimentally-determinable and in any case is only an approximation due to QM, 
I would refine this argument to say that the difference between the analytic functions and other function 
spaces is important because of the rigidity inappropriateness. Beyond that, I do not know if the particular 
function spaces used to prove rigorous theorems about the EFE's and about the extendibility of spacetime 
|87| may be substituted in these applications by approximations based on other function spaces. In this case, 
it may be a mere matter of convenience: one wants to use whichever sufficiently general function spaces 
accessibly give rise to the strongest possible theorems. 



x. 





Figure 7: The bucket-shaped construction that motivates the use of Sobolev spaces for the Cauchy 
problem for the Klein-Gordon equation. 



less involved proofs than Bruhat's, although if one seeks yet stronger results the functional 
analysis becomes extremely unpleasant. That Sobolev spaces are appropriate follows from 
simple consideration |363j of the flat spacetime Klein-Gordon equation. Given data on a 
bounded region Sq of a spacelike hypersurface Eq, one can draw the future DOD I?+[S'o] [fig 
6)] which is the region affected solely by this data due to the finite propagation speed of light. 
One can then consider the values of and its first derivatives on St = V^ISq] H T,t,t > 0. 
Then using the construction in fig 7, Gauss's (divergence) theorem and energy-momentum 

conservation, / fAct^f+ I TAcl^f = / TAct^f , 

Jsq Jb Jst 

(163) 

and the second term > provided that the matter obeys the dominant energy condition 
(DEC): 

-f^u^ is a future-directed timelike or null vector V future-directed timelike (164) 
and that is timelike. Then the definition of the energy-momentum tensor gives 

/ [(?)' + (V?)' + m\'] < I [i^f + (V?)' + m\'] (165) 
J St J So 

Because each integrand is the sum of squares (which are necessarily positive), this means 
that control over the data on 5*0 gives control of the solution on St- The idea of a Sobolev 
norm is a generalization of these last two 'energy' integrals j87j . The Sobolev class 
has a bounded norm of this type including up to wth order derivatives. There is then the 
following theorem jl91l I87j . 

Hughes— Kato—Marsden Theorem: Existence is guaranteed for the n-d EFE's in har- 
monic coordinates if the Sobolev class of the induced metric is no rougher than H"~^^ and 
that of the extrinsic curvature is no rougher than . 

This is the roughest mathematics fully worked out to date. The limitation on roughness 
comes from the definition of products, which is a necessary complication because the EFE's 
are nonlinear. 

2.3.4 CP's for GR including fundamental matter 

Matter theories on a fixed curved spacetime are built according to rju — > gjj and the 
'9 — > V rule. This undeniably produces the simplest field theories, but it is not certain 
that these are the ones chosen by nature, since the particle accelerators by which we judge 



our field theories are located in a rather flat region. Thus the study of these theories is 
subject to our ignorance of nature's unexplored high-curvature regime. In particular, the 
'9 — > V rule could in principle be ambiguous \2T6\ (but for electromagnetism the order is 
dictated by current conservation) or not realized in nature due to putative further symmetry 
reasons (e.g instead of minimal coupling for scalars, one might argue for conformal coupling 
pr4j ). We note that some of these theories would violate some form or other of the POE. 
By the nature of many such theories (e.g Brans-Dicke theory), this possibility cannot be 
overruled by direct tests of the POE. 

The curved spacetime version of the equations of Klein-Gordon, Maxwell and Yang- 
Mills theory (see 1.2.5) remain of the correct form to invoke Leray's theorem. The coupled 
version of these fields with GR may be built by pairing the corresponding equations with 
the EFE's (|148[ I149L I15()|) sourced by the corresponding energy-momentum tensor pieces 
(|182|) . In each of these cases, the total Leray form is blockwise the GR and matter Leray 
forms. The GR Leray form is disrupted by some of the nastier matter fields. This goes 
hand in hand with some of the unpleaseantries of VI and VIII. Dirac theory is sufficiently 
different that I defer its treatment to VI. 4. 

2.3.5 Global results 

Global results for the Einstein evolution equations are considerably harder to obtain than 
local ones. The need for global results stems from the generic onset of singularities from the 
evolution of smooth data. Were naked singularities to arise thus, all the above nice notions 
of predictability would break down. Penrose 289 conjectured that naked singularities do 
not arise generically (the cosmic censorship conjecture). Whereas this has been studied 
for some simple cases, it is an open question. Among these simple cases are those with 
high-symmetry, particular asymptotics, or a 'small' departure from exact data |311j . 

2.4 Variational principles for GR 

I next consider the ADM split from the point of view of the principles of dynamics. The 
applications in this thesis requires this just for the usual (3, 0; -1) case. Here are some 
useful results for variation with respect to the (arbitrary-d) metric gab- 



W\9\ = 2 Vbl5"'55afe , (166) 

Sg"' = -g^'g^'Sgij , (167) 

Sr\c = ^5"'' NciSgdb) + VbiSgdc) - Vd{6gbc)] , (168) 

ST'bc = \g''''^b{Sgcd) , (169) 

6R\,d = 2V[rf| {6T\\,]) , (170) 

= 2V[,| (,5^1,]) , (171) 

6R = -R'^gij + 2V'{VihSgcf) ■ (172) 



Then variation of the Einstein-Hilbert action 
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d'^a^VM (^ + l-Matter) 



(173) 



easily gives 



= S\ 



I 



+ (^LMatter 



which upon discarding the divergence term and adopting 
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(174) 



as the curved spacetime definition of the energy-momentum tensor yields the EFE's. 
Note that the pure gravity Einstein-Hilbert action may be rewritten in split form as 



j dt j d?xaVh[R -e{KoK- K^)] = I3+1 



(GR) • 



(175) 



upon discarding a divergence. Variation of this yields the split form of the EFE's of 1.2.2. 
It is also used below in further GR principles of dynamics work. 



2.5 Inclusion of fundamental matter fields 

There is no difficulty in including simple phenomenological matter in the above. Likewise, 
there is no difficulty in including fundamental matter fields at least to start off with. I use 
the below in IV, whereas VI- VII discusses difficulties with more complicated fields. 
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The corresponding split field equations are 



Vh ( 

2a \ 



5-.A> -d^^ 
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^(5^A^-d^<^ 
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(178) 
(179) 
(180) 
(181) 



To make coupled Einstein-matter systems, use these together with the split 
Theory p ja 



KG -i 



Sab 

d'^qdci, - {6p>if + m2?2 

1 {E X B)a -\[BaB\ + EaEb - hab{E^ + B' 



+ \ha 



(182) 



e/m h{E^ + B^) ^ 

of the energy-momentum tensor^^ ^]jg gpij^ EFE's. 

^^Notc that the 00 components are field energy expressions, the Oa components are field momentum fiux 
expressions (such as the Poynting vector of electromagnetism) and the ah components are field stresses. The 
Yang-Mills energy- momentum tensor looks just like the electromagnetic one, with B\, El in place of S', 



2.6 The Dirac— ADM Hamiltonian formulation of GR 



In the case of (3, 0; -1) GR where the fohating spatial hypersurfaces are assumed to be 
compact without boundary (CWB), L = a^/h{R -\- K o K — K"^) so 
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dl 



dhab 2a 



(183) 



This can be rearranged to hcd = ^{hachbd - ]^habhcd)p°'^ + 2£'(c/3d) • 



(184) 



It is also worth noting that for GR 



p = 2VhK . 



Then by Legendre transformation 

H{hij,p^''; a,pa\l3m.,pj) =poh - L{hij, hki; a, a; f3m,Pn) 



2a ( 1? 
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d^xH 



(185) 
(186) 



According to the standard interpretation, a and Pi are Lagrange multipliers. Variation 
with respect to these multipliers gives the constraints TC = and Ti.i = 0. Thus the GR 
Hamiltonian is zero (for the CWB case; else there would be surface terms). But this does 
not mean GR is trivial, since its Hamiltonian is only weakly zero: 



Hgr = , but HT°jt-i = Hgr + / d^x(aW + P'H 



d?x{an + = Hadm (187) 



2oL f p 

Hamilton's equations are then the evolution equations S^hab = [pab — i:hab 



(188) 

~2 



The first of these is just a rewrite of H184|) . 

The constraints indeed propagate. In Poisson bracket language, using the basic 

gravitational bracket {/iij(x),p'^'(x')} = 5'^(j5'j)(5'^^) (x, x') 

(190) 

and the smearing out of the constraints by arbitrary functions /?/ = [a, so as to obtain 
undensitized scalar functions on the gravitational phase space 

nioi) = I d?xaH , Uia) = [ d^xP'Hi , (191) 



then {f{hj,p'''),nm = £pf . 

(192) 

Thus the momentum constraint's action is associated with dragging around within each 
hypersurface i.e with the 3-diffeomorphisms. From 1)1921) it follows that 

{n{p'),nm = £^n{p') , (193) 



{n{a'),n{(3)} = £pn{a') . (194) 
A little extra effort is required to obtain the Poisson bracket of two Hamiltonian constraints: 

{n{a),n{a)} = W(7) , y = hh'\adja' - a'dja) . (195) 

Because this last bracket involves the metric, this is no true Lie algebra (for which only 
structure constants would appear, see footnote 19). So although the 4-diffeomorphisms 
form a true Lie algebra and the 3-diffeomorphisms also, splitting up the 4-diffeomorphisms 
into 3-diffeomorphisms and 'other diffeomorphisms' becomes messy. Unlike the 
3-diffeomorphism Lie-dragging invariance associated with the momentum constraint, there 
is no manifest symmetry associated with the Hamiltonian constraint. There is instead a 
remarkable hidden symmetry from the perspective of the split formulation of GR: invariance 
under refoliation (or of choices of time function). The symmetry is hidden because in the 
split formulation one is working within a particular foliation, so that what happens under 
refoliation becomes obscured. It is not known if or how the time function and the true 
d.o.f's of GR may be disentangled. This is part of the Problem of Time (see L3.3). 

I next consider the passage to the Hamiltonian in the presence of matter fields |19| 
I23U1I231] ^, denoting the Hamiltonian and momentum constraints obtained in this case by 
^TC and ^'Hi, and the contributions to these from the matter fields by qrTi and ^Tii. For 
example, for Einstein-Klein-Gordon theory, Einstein-Maxwell-theory and Einstein- Yang- 
Mills theory, the field momenta are 

7r = — = ^{^-£p^), 196 
a? a 

TT* ^ |t = ^(i* - £f,A^ - d'Ao) , (197) 

""'^^h^ " " + ^cf\^AiAf) , (198) 



the Hamiltonians are 



Hkg = j d^xian + P'Hi) , (199) 

,W = ^ + Vhim^ + m,e) , (200) 
,rLi = Tid^q , (201) 



Hem = / d^x{an + n + A^g) , 



(202) 



(204) 



g = daTT'' = , (205) 

Hym(G) = J d^x{an + p'Hi + A^Gj) , (206) 

A,n ^ ^vr^i + ^F, o F' , (207) 

A.Hi = TT^iDiAl - D,A}) - AjD.TTi , (208) 

Gj = D^TT-j = , (209) 

and the evolution equations are 

■fr = VhD''{Nda<;) + m^? + £^7r , (210) 

jr' = L'jL'b'(Ml) + £p7r' , (211) 

4 = DfD^^^iNA^}) + £(^4 . (212) 
2.7 Superspace and geometrodynamics 

The evolution of a mechanical system may be viewed as a curve (parameterized by some 
label A) traced in the configuration space Q. For GR, naively the configuration space is 
the space Riem of 3-metrics on a manifold of fixed topology, taken here to be CWB. The 
evolution may then be viewed as a curve in Riem, i.e as a curve of 3-metrics. However, 
GR has constraints, so one would like to pass to a reduced configuration space in which 
these are satisfied. This would correspond to the space of true d.o.f's of the gravitational 
field. It is relatively straightforward to take into account the momentum constraint since it 
generates the infinitesimal spatial coordinate transformations, so that the true d.o.f's of 

T ■ 1 ■ n Riem 

GR lie withm buperspace = . 

Din 

(213) 

This is the space of 3- geometries, not 3-metrics: how one paints the coordinates onto the 
geometries does not affect the physics. Thus GR may be interpreted as a theory of evolving 
3-geometries, or 'geometrodynamics', as Wheeler put it j274u376) . It is however not known 
how to take the Hamiltonian constraint into consideration in order to pass to a fully reduced 
configuration space. 

2.7.1 Geometry on Riem and Superspace 

One can introduce on Riem the geometry associated with 



F=^J y"/^-''='''/iijA'^,^,V^d3xV/7d3x' (214) 

|108j where P^^'' is an invertible tempered distribution. One usually talks of the geometry 
in terms of the general supermetric Gfj'l^i'^^^ which is related to f'^''^'^ by Gf^^f^^^ = -^ajki 

where eijki is the inverse of f^''^ in that j d^x" ^/hl' f^^"'^" e^nj^,,^, f, = 6''^^,6^ f,)6^^\x,x') . 



I concentrate on the ultralocal supermetrics jlU6j .^^ These generally consist of three terms 
hikhji, hiihjk and hijhj^i. Thus there is a 2-parameter family of ultralocal supermetrics up 
to overall scale. However, as far as the usual applications are concerned, there is just a 
1-parameter family of these, 

[hii' ^ 



Gfjik = (huhjk - ^-j-hijhkij (215) 

because the supermetric appears in these applications in products of the form G^^^^SijSki 
for s^^ symmetric, rendering equivalent the hi^hji and huhjk contributions. The inverse of 

this supermetric is 0%^^ = Vh (h'^h^'' - Wh'^h''^ 



where X and W are related by X 



(216) 

2W 



3W-1 

(217) 

The inverse supermetric does not exist for X = | while the supermetric itself does not exist 
for = i. By use of DeWitt's 2-index to 1-index map jlU6j .^^ 

haa = h^ (no sum) , /las = -^h^ , h^i = -^h^ , hu = -^h^ , (218) 

one can see that for W < \^ the ultralocal supermetric is positive-definite and for W > ^ 
it is indefinite, having one 'timelike' direction. For = | it is degenerate. See III.l and 
VIII for implications of these different cases. 

Whereas all the above ultralocal supermetrics play a role in this thesis, the supermetric 



in GR is the DeWitt supermetric |lU6j Gijki = —j= ^h^^hj^k) ~ 2^ijhki ) (219) 



which occurs in H. Its inverse is G'-'*'' = Vh - h'^h'''- 

(220) 

(the symmetrizations in these expressions may be dropped in the applications mentioned 
above). 

All the ultralocal supermetrics have the following symmetries 

'^abcd — '-'bade > ^ '^bedi — ^ '-'bide ■ \^'^^) 

Additionally, the DeWitt supermetric has the further independent symmetry Gijki = Gkiji 
whereas its inverse has the symmetries G'^^^ = -G^^^^ = G^^"^^ . 

(222) 

As explained below, the above geometries give just pointwise geometries on Superspace. 
DeWitt showed that the natural geometry inherited on Superspace has barriers causing it 
to be geodesically-incomplete, although the decomposition 



^^A function is ultralocal in some variable (here the metric) if it contains no spatial derivatives of that 
variable. 

''''The new indices thus obtained run from 1 to 6 and are written as hatted capitals in this thesis. 



of (|219|) by use of C = y^h^, and any other five coordinates orthogonal to it has a 
geodesically-complete geometry associated with its 5 x 5 block. 

Further study by Fischer jl36j (see also DeWitt's accompanying commentary jlUHj l 
demonstrated that Superspace is no manifold. It is rather a collection of manifolds called a 
stratified manifold, in which the individual strata are manifolds. One must realize that the 
quotienting out of 3-diffeomorphisms depends on the particular properties of each 3-metric. 
In particular it is easily demonstrated that different 3-metrics hij have isometry groups 
lsom.{hij) of different dimension: dim(Isom((5jj)) = dim{Eucl) = 6, dim(Isom(generic hij)) = 
0. Thus, in a manner clearly related to the underlying symmetries, at different 'points' (i.e 
3-geometries) , Superspace differs in dimension. Thus it is not a manifold, but rather a 
collection of manifolds each labelled by the conjugacy classes in the 3-diffeomorphisms of 
Isom. This type of mathematical structure is to play an important role in 11- VIII. 

Two schemes have been proposed to overcome the incompleteness or non-manifold struc- 
ture of Superspace. First, DeWitt suggested jlU8j that when one reaches the edge of one 
of the constituent manifolds (where the next stratum starts), one could reflect the path in 
Superspace that represents the evolution of the 3-geometry. Second, Fischer later alterna- 
tively proposed jl37j to replace Superspace with a nonsingular extended space built using 
the theory of fibre bundles. 

DeWitt had yet another Problem with interpreting paths on Superspace |108| . Because 
the lapse is undetermined one is in fact dealing with whole sheaves of paths rather than 
with individual paths. 

Minisuperspace 

It is substantially simpler to consider the finite truncation of Superspace to the homogeneous 
geometries: Minisuperspace |271j . Then the above geometry is the full geometry rather than 
just the pointwise geometry, since the geometry is the same at each point. The restriction 
of the DeWitt supermetric to this case j272j is then just a 3 x 3 array rather than a 6 x 6 
array at each space point. Thus there are 3-1 = 2 d.o.f's rather than 6-3-1 = 2 
per space point. For later use, Minisuperspace includes nine Bianchi classes labelled with 
slanted Roman numerals there are also even more symmetric 1 d.o.f solutions (e.g the Taub 
universes) which then simultaneously belong to several Bianchi types. Minisuperspace is 
potentially an important testing-ground for classical and quantum cosmology/gravity, since 
it is conjectured that the behaviour of the general GR solution upon approaching the initial 
singularity is an independent Kasner universe at each spatial point. Belinskii, Khalatnikov 
and Lifshitz (BKL) conjectured mixmaster behaviour (a sequence of Kasner epochs as occurs 
in Bianchi XI spacetimes) at each spatial point (see e.g |48|l27HI273j ) whilst straightforward 
Kasner (Bianchi J) behaviour at each spatial point can sometimes occur jll61 119^ . There 
is growing numerical evidence for these behaviours jlUj. I mostly consider Minisuperspace 
as a toy model to test ideas against (see 1.3.3.3, V.l, VI. 1.4). Although one might try to 
extrapolate ideas about and techniques developed for Minisuperspace, this is often fruitless 
because these break down in the full Superspace. 

2.7.2 Geometrodynamics: old RMW interpretation 

How is the two-layered body of mathematics in 1.2.6 and 1.2.7.1 to be interpreted? Wheeler 
made a first attempt at this j374j . He perceived that GR could be viewed as a theory 
of evolving 3-geometries: geometrodynamics. The central object of this scheme is the 
still-remaining vacuum Hamiltonian constraint TC. Whereas the momentum constraint is 
conceptually (if not technically) easy to deal with, TC leads to the Problem of Time which 
plagues geometrodynamics. 7i is also central in the sense that since Ti = G±± = 0, if this is 
known in all frames of reference i.e choices of projection operator, then the EFE's Gab = 
are known (assuming general covariance): 'field equations without field equations'. Ti. is 



also central in Gerlach's formulation |152j . I will also discuss how Tii may be regarded as 
an integrability of 7i rather than on an equal footing to it (II. 2. 2. 4). 

Wheeler's early hope was that vacuum geometrodynamics might turn out to be a Theory 
of Everything. This was based on |274j . in which a number of other physical features were 
argued to be incorporable into this vacuum geometrodynamics. The properties of a distant 
isolated mass were argued to be recoverable from geons i.e gravity waves almost completely 
trapped in some region ('mass without mass'). Aspects of charge could be recovered from 
the mere topology of empty space ('charge without charge'). The electromagnetic field 
tensor^^ was shown to be expressible in terms of the energy-momentum tensor, from which 
the Einstein-Maxwell system could be interpreted as reducible to a theory of empty space 
alone. The first part of the latter result was a rediscovery of a result by Rainich j3U5j . The 
idea was then to investigate whether all other known forms of fundamental matter could 
be similarly incorporated into this Rainich-Misner- Wheeler (RMW) already-unified theory 
of gravity and electromagnetism. In those days the Yang-Mills- type theories of the nuclear 
forces were not yet known, so the RMW theory was a unification of all the understood 
fundamental forces. But no way to incorporate spin-^ fermion fields was found, the massless 
neutrino field presenting much difficulty for this conceptual framework. As a result of this 
already in 1959 |373j Wheeler thought it unlikely that vacuum geometrodynamics would be 
sufficient to describe nature. RMW theory also turned out to have a somewhat defective 
IVP |288j and as far as I know, nobody ever succeeded in writing down an action for it |274) 
or in recovering the superposition principle for electromagnetism. 

This interpretation was not a waste of time since it prompted the first exercises in the 
construction of 3-geometries as initial data sets, which underlies much of modern numerical 
relativity. 

2.7.3 Geometrodynamics: modern interpretation 

Wheeler asked some important questions about the Superspace interpretation of geometro- 
dynamics |376j .^^ The Hamiltonian constraint remains central but is to be understood to 
be with matter 'added on' j376j . The first question (quoted at the start of) is why the 
closely-related Einstein-Hamilton-Jacobi equation takes the form it does, to be answered 
without assuming GR. This thesis chiefly (II- VIII) consists of an answer to this question 
along the lines of the recent proposal by Barbour, Foster and O Murchadha. There is also 
an older rather different attempt to answer this question by Hojman, Kuchaf and Teitel- 
boim (see II. 1.1). One of the issues at stake with these answers is that they should past 
the test of natural inclusion of all the known fundamental matter fields. As in the RMW 
interpretation of geometrodynamics, it is the natural inclusion of spin-^ fermions that is 
troublesome. 

Second, what kind of structure does Superspace have? It was reported |345j that Stern 
proved that Superspace is Hausdorff-separated. There is also the indefinite DeWitt super- 
metric naturally defined on Superspace. There is than the issue of whether spatial topology 
change is permitted in classical geometrodynamics. It is not, as shown by Geroch |153j . 
So classically one must consider one separate Superspace per choice of spatial topology. 
Finally, is there any classical explanation for the 3-d Riemannian character of space and its 
ability to accommodate the known matter fields of nature. 

Third, what is two-thirds of Superspace? I.e how can the true d.o.f's in GR be rep- 
resented? This question was almost answered by York (see 1.2.9.4.2) and is also further 
discussed in III. 2. 

Note that Wheeler had in mind the case of evolving CWB geometries, which have the 
following merits and plausibilities. Einstein's arguments for a closed universe are based 
on its simplicity and on attempting to realize Mach's principle |127| I129j (it is displeasing 

Another difficulty is that only the non-null case of this was included. 
^^The quantum 'conceptual difhculties' in 1.3.3.3 are also Wheeler's. 



for 'absolutist' boundary conditions at infinity to interfere with local physics). Wheeler 
used the thin sandwich formulation (see below) to attempt to conceptualize classical and 
quantum geometrodynamics [3731 1375T I37tjj . and to implement Mach's principle |375| . The 
conformal IVP (also see below) was found to provide more adequate protection in the form 
of rigorous mathematical theorems than the thin sandwich, causing Wheeler later |199l 1377] 
to shift the interpretation of Mach to be instead in terms of constant mean curvature and 
conformal 3-geometry. This led to the idea of 'Wheeler-Einstein-Mach universes' [194j . 
A further simplicity argument for CWB geometries is than that the GR conformal IVP is 
simplest for these. 

Whether the favoured cosmological model for the universe is open or closed has shifted 
around during the history of observational cosmology. One should note however that 
observationally-open universes may close on a larger-than-observed scale e.g along the lines 
of Lindquist's gluing together of many Schwarzchild solution pieces |255| . or of topological 
identification j242j . Conversely, closed-looking could actually be open via hitherto unprobe- 
ably small holes leading to open regions. So it looks to be down to a matter of taste. 

Finally, the inclusion of simple forms of matter such as Klein-Gordon scalars, electro- 
magnetism and Yang-Mills theory does not disrupt the above interpretation of vacuum 
geometrodynamics (see IV, VI. 2). However, nastier matter fields do disrupt it (VI. 2. 4). 



2.8 The Baierlein— Sharp— Wheeler action for GR 

I will often use the manifestly reparameterization-invariant Baierlein-Sharp-Wheeler (BSW) 
|26j formulation of GR, which arose by analogy with QM transition amplitudes during 
Wheeler's attempts to interpret geometrodynamics. This formulation arises from rewriting 
the Lagrangian formulation of GR 

l3+i(GR) = / / d^xVha U+ J ^js = ^hikf^ji _ h'^h'i)S^hi,5^hM , (224) 



1 m 

by using the a-multiplier equation a = ± — \/ -77 , 

2 V it 

(225) 

to algebraically eliminate a from (|224|) (notice the analogy with setting up the homogeneous 
Jacobi principle). Thus (assuming R ^ everywhere in the region of interest) one arrives 
at the BSW action 

Ibsw = y y d^xVh^/RJ^ . (226) 
This formulation is the start of the first method we consider below for the GR IVP. 



2.9 The GR IVP 

GR is constrained. I now explore how the solution of these constraints has been approached. 
I include arbitrary dimension and signature to start off with, although I often then specialize 
to the usual (3, 0; -1) case, deferring treatment of the other cases to Part B where they 
are required. I include phenomenological matter. Several methods have been proposed, of 
which I only consider those which are general (i.e independent of symmetry restrictions). 
BSW proposed the thin sandwich approach (see 1.2.9.1). Although largely unexplored, 
there is evidence that this approach is of limited success. Two older types of method are 
the elimination and conformal methods (in 1.2.9.2 and 1.2.9.3 respectively). Elimination 
methods are intuitive in that the prescribed quantities are all physical, but rely on brute 
force and have a number of undesirable mathematical features. In contrast, the conformal 
method is counterintuitive in that the prescribed quantities are unphysical, but this method 
exploits well the mathematical properties of the constraint system in order to decouple 



it. Lichnerowicz proposed and argued for the conformal method in 1944 [2b'6\ . Bruhat 
compared conformal and ehmination methods in 1956 [H^, accumulating good reasons to 
favour the conformal method. However, I found that her arguments by themselves do not 
suffice to dismiss Magaard's 1962 method |266j and others. I fill in this gap below by 
extending her arguments. York and others substantially developed the conformal method 
in the 1970's |SHH1 EES EHOl HZSl HHD ESi ESZl ; this is commonly used as a cornerstone of 
numerical relativity (see 1.2.11). Further methods for the GR IVP are outlined in 1.2.9.4-5 
including a small work of mine. Fundamental matter is included in 1.2.9.6. 



2.9.1 The thin sandwich method and conjecture 



The thin sandwich method is a Lagrangian method: freely specify the metric hij and its 



1 / TS 

velocity hij. Eliminate a from its multiplier equation ^ — n 



2\l aR-2p ' 

(227) 



thus forming Its = j dt j d^xVh (^\/{aR - 2p)'\^ + (5if 

(228) 

where a = —e and p is the energy-momentum tensor component Tqo. Then treat the (5i 
multiplier equation (which replaces the Codazzi constraint) as a p.d.e for /3j itself: 



aR-2p 



h , (229) 



obtained using the explicit expressions for a and T^. The conjecture is that ()229|1 has a 
unique solution under a suitably broad range of circumstances. If this is true, then it is 
trivial to find a using ()227() and then to find Kij from its definition ()137() . This last step is 
'filling in the sandwich'. 

There are in fact two sandwich methods. The 'thick sandwich' involves prescribing the 
metric on two nearby slices and constructing the spacetime between them, whereas the 
other discussed above is the limit of this as the two nearby slices become arbitrarily close. 

As regards the conjecture, although progress has been made, a regular method to solve 
this has not been found and counterexamples exist. This is all for the s = 0, cr = 1 case 
with phenomenological matter. 

To avoid the 'Problem of zeros',^'' one assumes 2p — aR > in the region studied. Then 
solutions to the thin sandwich equation with Dirichlet boundary conditions are unique 
[461 143j . But one has no guarantee of existence, as shown by counterexamples in |46[ I392j . 

This is however not the end of the thin sandwich idea. See C.3 for the effect of considering 
an Einstein-Maxwell thin sandwich, and 1.2.9.4.1 for the modern conformal thin sandwich 
formulation. 



2.9.2 Componentwise methods, including traditional elimination methods 



2.9.2.1 A systematic treatment of componentwise methods 



The Gauss-Codazzi constraint system consists of n + 1 equations relating n{n + l) functions 
{hij, Kij). A simple idea is to treat this system componentwise: consider (n — l)(n+l) of the 
(/ijj, Kij) as knowns and attempt to solve the system for the remaining n + 1 components. 
Hawking and Ellis 11771 state that in the usual n = 3 case it is possible to solve this system 
for any such choice of knowns and unknowns. I investigate this below, but find it profitable 
to consider first what the different sorts of componentwise methods are. I attempt to cast 



the Gauss-Codazzi constraint system as some "evolution" system with respect to some 
auxihary IDV (without loss of generality xi). In this subsection, I simply attempt to cast 
the system in Cauchy-Kovalevskaya form, which would suffice to establish existence and 
uniqueness when the coefficients in the system and the 'data for the data' at the "initial" 
value of xi are all analytic. 

Some of the componentwise methods are algebraic elimination methods. For example 
when one considers an embedding of a given lower-d metric, then the Gauss equation is a 
mere algebraic expression in the unknowns (which are by default some selection of extrinsic 
curvature components). There are two such componentwise procedures. 
Method 1. If a diagonal component, without loss of generality Kn, is available among the 
unknowns, one can choose to interpret the Gauss constraint as an equation for this which 
is linear by the antisymmetry (|222|) of the DeWitt supermetric: 

2KnG'^''^K^^ + G^''^'"K^uKi^ + 0^""^^ Kuu,K^y = -{eR + 2p) (230) 

where u, w, x,y ^ 1). Thus if Z = G^^"^"^ K^w has no zeros in the region of interest, Kn can 
be straightforwardly eliminated in the Codazzi constraint, which one may then attempt to 
treat as a p.d.e system for the other unknowns. This method was criticized by Bruhat [6|] 
in the usual-dimension-and-signature case. These criticisms are discussed and extended in 
1.2.9.2.3. 

Method 2. Since there are n + 1 unknowns and only n diagonal components, it is always 
true that there will be a nondiagonal component, without loss of generality K12, among the 
unknowns. One may then interpret the Gauss constraint as a quadratic equation in this: 

G'^^^KuKu + G^^^'^KuK.a + G^^'^'^KabK^^ = -{eR + 2p) (231) 

for ah, cd ^ 12 or 21. So long as Z = G^^^^ = h}^h?''^ — h}'^h^'^ has no zeros in the 
region of interest, the solutions of the Gauss constraint may be substituted into the Codazzi 
constraint, and one may then attempt to treat this as a p.d.e system for the other unknowns. 
Method 3. If in contrast with the above two methods, there are metric components 
among the unknowns, the Gauss constraint may be treated as a p.d.e. for one of these. 
The resulting componentwise methods do not then involve any algebraic elimination. These 
are componentwise p.d.e methods. Note that p.d.e's rather than a mixture of p.d.e's and 
algebraic equations generally lend themselves to making stronger existence and uniqueness 
proofs. 

2.9.2.2 The result in Hawking and Ellis 

Claim (In 3-d) one can show that one can prescribe 8 of the 12 independent components 
of (/ijj, Kij) and solve the constraint equations to find the other 4. 

Although Hawking and Ellis 177 support this merely by citing where certain cases 
for the split into known and unknown components are considered, I prove this here for 
all cases in certain small regions^^ for the analytic functions. Because I proceed via the 
Cauchy-Kovalevskaya theorem, my proof is clearly signature-independent. Moreover, there 
are additional complications if the dimension is increased. 



The domain of applicability of each subcase is subject to one of the following kinds of 'Problem of 
zeros' of varying severity. First kind: in setting up the p.d.e system, if division is required by a function of 
unknowns (over which one has therefore no control) then the emergence of zeros of this function invalidates 
the procedure in regions to be discovered. Second kind: if the function is rather of knowns, one at least knows 
a priori in which regions the system will and will not be valid in. Third kind: the system is always valid, 
but the casting of it into Cauchy-Kovalevskaya form may involve division by a function which potentially 
has zeros, or more generally inversion of a matrix whose determinant may possess zeros. 



Proof It is important first to notice which di terms are present in the unehminated Gauss- 
Codazzi system since I am treating this as an "evolution" system with respect to xi. The 
Gauss equation is hnear in Ofh^w, quadratic in dih^i and hnear in dihu. The 1-component 
of the Codazzi constraint contains the di derivatives of all the variables, whereas the u- 
component (for any u ^ 1) contains only the di derivatives of K^a and of all the metric 
components. I next consider the different possibilities for the unknowns case by case. 

Suppose the 4 unknowns are all components of Kab- Then one may attempt to use 
method 1 or method 2, that is one eliminates some component E from the Codazzi constraint 
by use of the Gauss constraint. The 2-component of the Codazzi equation may be solved 
for a component, the 3-component of the Codazzi equation may be solved for a K^b 
component, and the 1-component of the Codazzi equation may be solved for any other Kab 
component, C. This never causes any trouble since there must always be such components 
among the 4 unknowns. The i?-eliminated Codazzi system may then be cast into a 3- 
equation first-order Cauchy-Kovalevskaya form, and the final unknown E is then to be 
read off the Gauss constraint. 

If just one of the unknowns is a metric component m and the other unknowns do not 
share an absent index (without loss of generality are not Kn, K12 and K22 which share an 
absent 3-index), then one can set up a 4- or 5-equation first-order Cauchy-Kovalevskaya 
form. The 5-equation form corresponds to when m is some huw so that the Gauss constraint 
is second-order and can then be decoupled via dim = y; the 4-equation form corresponds 
to the other possible choices of m. The Codazzi constraint may be solved for the 3 extrinsic 
curvature components. If there is a shared absent coordinate without loss of generality X3, 
then the corresponding 3-component of the Codazzi constraint contains no partial deriva- 
tives with respect to the dynamical variable of the extrinsic curvature unknowns and so the 
above scheme cannot be used. Then for the Gauss constraint, use method 1 or 2, giving a 
3-component Codazzi system. 

If precisely two of the unknowns are metric components, then without loss of generality 
one of the extrinsic curvature unknowns carries a 2-index. Then one may solve the 2- 
component of the Codazzi constraint for this component and the 1-component for whichever 
other extrinsic curvature unknown one has declared. Then the 3-component of the Codazzi 
constraint and the Gauss constraint are to be solved for the two metric unknowns. Thus 
one may obtain a 4 or 5-d first-order Cauchy-Kovalevskaya form. 

If there is just one extrinsic curvature component unknown, then the 1-component 
of the Codazzi system may be solved for it and all the remaining equations be solved 
for the unknown metric components. One then obtains a 5-equation first-order Cauchy- 
Kovalevskaya form. 

If all the unknowns are metric components there are no difficulties in writing down a 4- 
or 5-equation first-order Cauchy-Kovalevskaya form. For example without loss of generality 
a metric component huw must be among the unknowns (since there are 4 unknown metric 
components and only 3 can contain a 1-index). Then the Gauss equation is of the form 

(/i^/i^^ - h''^h'"^)didihuw = F{hij,dkhij,dkdihij with k, I not both 1, Kij,p) (232) 

which for h'^'^h^^ — h^^h^^ 7^ can be combined with the Codazzi equations arranged 
as equations for the d-derivatives of any n other unknown metric components to form a 
system to which the Cauchy-Kovalevskaya theorem is applicable [e.g decouple (|232j) to 
form a system of 5 first-order equations]. 

Finally, note that the above simple scheme, in which the Gauss-Codazzi constraint sys- 
tem is cast as an evolution system with respect to some auxiliary IDV, does not by itself 
suffice to investigate whether Hawking and Ellis' result holds for higher-d. For, consider the 
unknowns to be Kn, K12, K13, K22 and Then the 4-component of the Codazzi con- 
straint does not contain any 1-derivatives and so it is a constraint rather than an "evolution" 
equation with respect to the auxiliary IDV xi. 



2.9.2.3 Bruhat's criticisms and Magaard's argument 

Bruhat's first criticism of elimination method 1 j64j is that it is a non-covariant procedure. 
This contributes to it being highly ambiguous, since the nature of the prescription depends 
on the choice of coordinates. It is also ambiguous because there is no unique clear-cut 
way of choosing which components are to be regarded as the knowns and unknowns. This 
criticism holds for all the componentwise methods above. 

Bruhat's second criticism of elimination method 1 is that the eliminated system is not 
valid if the region of interest contains zeros of Z. Some form of this criticism holds for all 
three methods, although different kinds of the 'Problem of zeros' may occur. In all these 
cases, the occurrence of zeros indicates that the proof is not necessarily valid everywhere 
within a given region of interest. 

Whenever the first kind of Problem of zeros occurs, nevertheless data construction for 
small regions is permitted by Magaard's argument (originally stated for the particular case 
below). First, one is entitled to declare 'data for the data' on some (n — l)-d (partial) 
boundary set Xi = {xi = some constant}. For, although Z is a function of unknowns in 
the region of interest away from this set, these unknowns are declared to be known on the 
set itself, so one can choose to prescribe them there so that the resulting Z\x-^=c is bounded 
away from zero. Then by continuity, there are no zeros of Z near the set on which the 'data 
for the data' is prescribed. 

Magaard's particular method treats all the lower-d metric habS components as knowns 
because his aim was to prove an embedding theorem (the Campbell-Magaard theorem). 
Magaard's method is the special case of componentwise algebraic elimination method 1, 
in which after eliminating Kn by use of the Gauss constraint, the Codazzi constraint is 
treated as a p.d.e system for unknowns U = {Ki^, some other K^v component denoted 
E}. This satisfies the criteria for the Cauchy-Kovalevskaya theorem if one treats U' = {all 
the components of Kij bar Kn , Ki^ and E} as known functions on xi and provided that 
the p.d.e's coefficients and the data are analytic. So a unique solution exists. One can 
then group [2661 114j this method and the local existence of a unique evolution to form the 
Campbell-Magaard statement that a (r, s) spacetime with prescribed analytic metric hab 
may be embedded with an extra space or time dimension for any analytic functional form of 
the energy- momentum tensor. This statement suggests very many embeddings exist, which 
as argued below and in Part B, is a disaster for supposed applicability of these embeddings 
to build meaningful higher-d worlds in brane cosmology or 'noncompact KK theory'. 

Magaard's method leads to two prices to pay later on. Although the 'data for the data' 
(values of U on Xi = {xi = some constant }) can be validly picked so that Z ^ on 
Xi, I note that the protecting continuity argument is only guaranteed to produce a thin 
strip Q < xi < r] before zeros of Z develop. Now, the first price to pay is that one cannot 
expect to be able to patch such strips together to make extended patches of data. For, 
since the strip construction ends where Z picks up a zero for some xi = rj, while restarting 
the procedure with xi = rj in place of xi = is valid, the two data strips thus produced 
will have a discontinuity across xi = rj. So what one produces is a collection of strips 
belonging to different possible global data sets. The evolution of each of these strips would 
produce pieces of different higher-d manifolds. So the statement that an empty (n + l)-d 
manifold 'locally surrounds' any n-d manifold has a more complicated meaning than might 
be naively expected. Concretely, what has been proven is that any n-d manifold can be cut 
up in an infinite number of ways (choices of the xi coordinate) into many pieces (which are 
a priori undetermined), each of which can be separately bent in an infinite number of ways 
(corresponding to the freedom in choosing the components of Kab in on each set of 'data 
for the data'), and for each of these bent regions one thicken the region with respect to the 
extra dimension /x to find a piece of an (n + l)-d manifold. Furthermore, all of this can 
be done for every possible analytic function form of Tab (corresponding to the generalized 



Campbell-Magaard result) jl4j . This excess richness compromises attaching any physical 
significance to any particular such construction. 

I found three other indications that Magaard's method is not well thought-out. First 
it is not considered how far the 'data for the data' extends along xi = 0. Clearly the 
topology of the n-d manifold is an important input, for if it is not CWB, there is a missing 
boundary or asymptotics prescription required to make Magaard's method rigourous. Also 
the topology of the xi = set itself has not been brought into consideration. For example, 
the method does not guarantee continuity if xi = contains loops. It may also be that 
the coordinate condition xi = breaks down within the region of interest. In the original 
Campbell-Magaard theorem statement the word 'local' is left to cover these and yet other 
aspects! This makes the theorem less powerful in truth than in superficial appearance. 

My second point applies only to the s = 1, e = 1 case in Part B, for which the existence 
of an evolutionary region after the data construction step may easily be ruined by an 
'information leak Problem'. The usual signature case s = 0, e = —1 does not suffer from 
this thanks to the DOD property. Again this is an implicit limitation, in this case either the 
signature-independence of the theorem or of its physical applicability is found to be untrue 
(depending on how exactly the theorem is phrased). 

Third, Bruhat's second criticism still holds: Magaard's method lacks any n-d general 
covariance since it involves the choice of a coordinate xi and the elimination of a 11- 
component of a tensor. 

The small strip Problem and the second point above are badly convoluted for the Maga- 
ard method since the zeros of Z are not known until one has solved very cumbersome p.d.e's 
arising from the Codazzi constraint. This substantially inhibits the construction of specific 
examples of the pathologies and the study of how widespread they might be. Because of 
this, my proof based on method 2 supercedes Magaard's. Now, the zeros of Z are known 
from the start, making this a cleaner procedure, and also one for which counterexamples 
can be read off: any n-metric for which /i^^/i^^ — h^'^h}'^ is within the region of interest 
will do. However, for this proof Magaard's idea of guaranteeing no zeros on the 'data for 
the data' itself also fails because the zeros are now entirely controlled by knowns. What 
one has is a statement about existence in certain regions known beforehand. A yet more 
satisfactory method is provided in 1.2.9.5. 

2.9.3 The conformal method of Lichnerowicz and York 

The arguments of Lichnerowicz j253j and Bruhat [HI] led to the GR IVP taking a very dif- 
ferent route from the above sort of brute-force elimination methods. Lichnerowicz' method, 
which was much developed and extended by York |388l 13891 139UI . is instead preferred. 
Its treatment below is as far as possible for general (r, s; e), and then specializes to the 
(n, 0; -1) and particularly to the (3, 0; -1) case. 

In the conformal method one chooses to treat hij as a known metric which is moreover 
not the physical metric but rather only conformally-related to it by 



hij = ip'^hij 



(233) 



where the conformal factor ^ is a positive suitably smooth function. Then 



ab 



(234) 



h = i/j'^'^h 



(235) 



(236) 



^ _-pa ,nrjd^ 



R = ij- 



R + r]{n — 1) [ 1 — T] 



n-2\ |L>VP 



(238) 



A metric is conformally-flat if there exists a conformal transformation from it to the 
corresponding flat metric. Furthermore, when it exists (n > 4), the Weyl tensor serves as a 
conformal curvature tensor because it has the foUowing properties: 

1) it is conformally invariant W"'i,cd = W^bcd , 

(239) 

2) W^bcd = the metric is conformally flat . 

(240) 

Work in terms of the K"^ and K split, and permit K'^^^ , f and p to conformally 
transform according to K^'^^ = i/;'^''^'^ K^^^ , f = , p = i/^'^p , 

(241) 

whilst crucially demanding the constant mean curvature^^ (CMC) condition 

K = hypersurface constant, C(/u) (242) 
holds and is conformally-invariant. This includes as a subcase Lichnerowicz's earlier use of 
the maximal condition K = . 

(243) 

One then demands that the (raised) Codazzi constraint (|156j) is to be conformally- 



invariant. Since DaK^^'^^ = ip' 



C-2r? 



DaK^"'' +[C-2r] + r]--^ ) K 



" + 2\ j.TabDai' 



then one requires by H236|) that —rj — - — = C~2?7 = ^ . 

(244) 

Furthermore it is desired that the conformally-transformed Gauss equation (|155|) . i.e 
R-rj{n-l)^+r]{n-l) (l " ^^ + e(M^2C-r?_^2^r,^2pV^+'') = (245) 



by (I^HHl) . where M = o and m is proportional to K, should contain no I^^V'I term. 
;^,C = -2ande = -2i±i 



Thus, r] = ( = —2 and ^ = —2^^ [making use of (|244j) ]. Now, regardless of (r, s; e), 



provided that the (n + l)-d DEC is to be preserved by the conformal transformation, p^ 
must conformally-transform like j°'ja, implying that oj = —2^^. Then H245|) becomes the 
(r, s; e) version of the Lichnerowicz-York equation^'' 

L>V = V -^R - MV'" ~ + - 2p'il^~^~ (246) 

4 n — 1 V / 



^*One can instead (and in close parallel with the above) choose to work with momenta in place of extrinsic 
curvature. Most of the applications in thesis are formulated in terms of momenta. 



This is the same notion as in 1.2.1. 



39 r 

^''The main feature due to s is hidden in . 1 use rather A (the Laplacian, which is an elliptic operator) 
for s = and □ (the wave operator, which is a hyperbolic operator) for s = 1 to bring out this important 
difference in applications. 



for the conformal factor ■0. Lichnerowicz's original equation had no m term in it and no 
phenomenological matter term p; the study of these additional features was carried out in 
the usual (3, 0; -1) case by York and O Murchadha 

Now the solution of the Codazzi constraint is decoupled from the solution of the Gauss 
constraint. The former proceeds by a traceless-transverse (TT)-traceless-longitudinal (TL) 

splitting ,n!inii!M] 



(for some vector potential and for {LW the conformal Killing form), which along with 
the trace-tracefree split is conformally-invariant |391j and thus unaffected by the solution of 
the latter, which has become the p.d.e H246() for the conformal factor. The simplest case is 
j^Tij ^ j^TTij ^^i^j^ possible for f = 0. Then ah one needs is any K^^"^. The standard 
case is that K^^ is known and that one is to solve the Codazzi constraint as a second-order 
p.d.e for the potential out of which K"^^ is built. See C.2-3 for such procedures, along 
with tricks and simplifications for solving the Lichnerowicz-York equation. 

Once the Codazzi constraint has been solved, KJ^ is known so M is known. Then one 
can attempt to solve the Lichnerowicz-York equation, which is well-studied for the (3, 0; -1) 
CWB and asymptotically-flat cases. By the artful construction above, this is a quasilinear 
elliptic equation, permitting the approaches outlined in C.2.2. It has been studied including 
most fundamental matter fields jl98j . and for Sobolev spaces matching those then used in 



Note that the choice of using the scale-scalefree decomposition of the metric and the 
trace-tracefree and TT-TL decompositions of the extrinsic curvature are irreducible n-d 
generally-covariant choices, a decided advantage over the coordinate-dependent, ambiguous 
componentwise methods. This and the decoupling of the constraints in the conformal 
method are signature-independent. Some of the methods to solve the Codazzi equation 
can be used regardless of signature, but the usual study of the Lichnerowicz-York equation 
involves elliptic methods which are absolutely not generalizable to the ,3 = 1 case. 

One can attempt to preserve the maximal or CMC conditions away from the 'initial' 
hypersurface Tq, by solving lapse-fixing equations (LFE's, see L2.10). Such choices of 
slicings deliberately prevent unnecessary focussing of geodesies j2531 1335] . enhancing the 
practical longevity of the evolution (the opposite occurs for normal coordinates!). However, 
the conformal method is not absolutely general, for some spacetimes may not have any 
maximal or CMC slice to identify with Tq in the first place, while in others the maximal or 
CMC slicing cannot be maintained to cover the whole spacetime (see 1.2.10). Furthermore, 
results concerning this depend on the asymptotics assumed. However, in the usual 
(3, 0; -1) case, this method (and its variants) is widely accepted as a practical method by 
the numerical relativity community (see 1.2.11). The LFE study to date has relied on them 
being elliptic, and thus cannot be simply generalized away from s = 0. 

Finally, for s = there is the useful property that certain local data patches can be 
proven to suffice for the treatment of astrophysical problems, by building on the notion of 
DOD. As promised, this sort of technique is also applicable to protect pieces of local data 
obtained by elimination methods such as that of Magaard. 

2.9.4 Further approaches and related mathematics used in (3, 0; 1) case 
2.9.4.1 The conformal thin sandwich method 



It was these caveats, along with the rehance on s = of York's method and restrictions on the values 
R can take for some subcases of the Lichnerowicz-York equation, that made me suspicious of the supposed 
generality of the Magaard method. 




(247) 



the GR CP |M1IH1|. 



This [3941 1891 18t)[ I395j is a combination of conformal and thin sandwich ideas. There are 
5 unknowns, taken to be if: and ay = i^^oi, whereas hij up to scale, hj^ = ip~^hj- and 
the hypersurface constant K are taken to be knowns. As compared with the original thin 
sandwich scheme, 7i is now an equation for ^/^ while there is a new equation for the scaled-up 
lapse. Then one writes the definition of KJ- in the form 

Kl = ^[{\LOr - h"^^] (248) 

and one uses this in the Codazzi constraint (|156() to obtain an equation for One addi- 
tionally has the Lichnerowicz-York equation (|246|) and 



One solves these as coupled equations. One can then finally reconstruct the physical h^j 
and K"^^^ (and hence K^^) from (|137|) and (|248j) . The theoretical point of this formulation 
is that everything then scales correctly. 

2.9.4.2 Conformal Superspace 

In the CWB case, York showed that 'S of Superspace' may be taken to be Conformal 



Superspace (CS) |391| CS 



3 

Riem 



3-diffeomorphisms x conformal transformations 

(250) 

(see jl38j for a more recent mathematical study) . This is the restriction of Riem for which 
the momentum is transverse [Dip^^ = 0) and traceless (p = 0). 

The quotienting out of conformal transformations is based on the decomposition of arbi- 
trary 3-metrics hij into their determinant h and their scale- free part /i™'* = h~ihij. CS is 
essentially the geodesically-complete space arising in the split (|223jl . There is accumulating 
evidence that a case can be made for CS almost corresponding to a representation of the 
space of true d.o.f 's of GR. It turns out that to obtain this for GR one must adjoin a solitary 

global degree of freedom: the volume of the universe, ^ ~ j ^^^^"^ ' 

(251) 

which interacts with the infinitely many local shape d.o.f's represented by h^f^'^- I investigate 
theories on CS and 'CS+V in III.2. 

2.9.4.3 3-d Conformal tensors and the York 1971 formulation 

For dimension n > 3, the Weyl tensor exists and serves as a conformal curvature tensor. 

For n = 3 one may define instead the Bach tensor babe = 2 ^D[ci?;,]a — -D[cRhh]a^ 



(252) 



and form from it the Cotton-York tensor density y"^ = B°'^ = -^e"''^''h^-^bfde 

which is a conformal curvature tensor. 



(253) 



This is used first in York's earhest (1971) formulation of the GR IVP, which is cast in 
a form which looks closely-analogous to that of electromagnetism: 



DbB'''' = , DbE"^ = (254) 



B = E = (255) 

for E*^^ = hip'^^ the momentum density of weight |. However, instead of the physical gauge 
freedom of electromagnetism, what is physical here is the gauge fixed hy solution of the 
original (vacuum maximal) Lichnerowicz equation. 

2.9.5 A new elimination method in terms of irreducibles 

The route to avoid Bruhat's non-covariance criticism and the ensuing ambiguities of pro- 
cedure is to work not with components but with irreducibles. The conformal method uses 
the scale-scalefree split of the metric and the TT-TL splits of the extrinsic curvature. It is 
then declared that the n + 1 unknowns are the single scale of the metric and the TL part 
of the extrinsic curvature encapsulated in the n- vector potential Wi. 

With this hindsight I constructed an irreducible elimination method . Write the con- 
straints in terms of the trace-tracefree decomposition. Now consider the Gauss equation as 
an 



algebraic equation for the trace part K: K = a \/ K"^ o K"^ — aR + 2p , 

\ n — 1 

(256) 

and substitute this into the Codazzi equation to obtain 



D'k]^j - \l "^^Dj oK^ -aR + 2p = jj . (257) 



Now treat this as an n equations for the n unknowns KJi^ encapsulated in the n-vector 
potential Wi: 



Kf - ^Dj{eR + 2p) (258) 



where K and KJ^ are treated as functions of Wi, dmWn and knowns [which may be easily 
written down using (|137() and H247() ]. See C.3 for more general consideration of this equation. 
It suffices to say that it is quite a complicated equation, but then so is the thin sandwich 
equation or the conformal thin sandwich system. 

Here I note that if xi is declared to be an auxiliary IDV, then if two certain functions do 
not have any zeros in the region of interest, the system (|258jl may be cast into second-order 
Cauchy-Kovalevskaya form. Very quickly, isolating the relevant terms of the w-component 

of the Codazzi equation leads to dfWu = Fu{Wi,djWk', knowns) , 

(259) 

provided that division by Kh^^ is valid. The 1-component gives 

which upon use of ()259|) and provided that division by Kh^^^^-^ — K"^^^ is valid gives 

dfWi = Fi{Wi,djWk; knowns) . (260) 



This provides a method of proof of the Campbell result which is less prone to Bruhat's two 
criticisms since its Problem of zeros is of the third kind so at least the eliminated system is 
always valid, and the system is built in an unambiguous generally-covariant manner. It will 
still be prone to the information leak Problem if s = 1, but as a genuine GR IVP method 
(s = 0) it is protected by the DOD property. 

So whilst some aspects of the old thin sandwich scheme have resurfaced again in the 
modern conformal thin sandwich scheme, the above is an attempt to have some aspects of 
old elimination schemes resurface in a new method with enough of the old faults corrected 
to be of potential interest to numerical relativity. 

All quantities prescribed in my method are the physical ones. Although it is an elim- 
ination method, it is carefully thought-out: the eliminated quantity has no prefactor and 
hence no associated Problem of zeros invalidating the eliminated system in certain places, 
and the procedure is uniquely-defined and coordinate- independent. In my method, one is 
faced with a single, more difficult vector p.d.e in place of the simpler Codazzi vector p.d.e 
followed by solving the decoupled Lichnerowicz-York equation. 

2.9.6 The IVP with fundamental matter and for alternative theories of gravity 

Minimally-coupled scalars offer no complications. In the Einstein-Maxwell IVP one has to 
scale Ei = '>p~^Ei and pe = ip~^pe so that the electromagnetic Gauss constraint (|2fl5|) is 
conformally-invariant |198j . This idea is considerably generalizable, both to more compli- 
cated matter theories and to large classes of alternative theories of gravity jl981 1197j . I 
note however 1) that the preservation of energy conditions is not always applicable (also 
mentioned in [SFJ if complicated enough matter is included and 2) that higher-derivative 
theories remain uninvestigated and may cause significant difficulties if required for numerical 
relativity through having a more complicated IVP than GR. 

2.10 Maximal and CMC slicings 

This section is for the (3, 0; -1) case and in terms of momenta to match its application in 
III.2. 



The maximal condition for an embedded spatial hypersurface is p = . 



(261) 



2p 

The CMC condition is ry = — 1= = C(A) , a spatial constant . 

3v /i 

(262) 

In GR, one regards H261() and 1)262(1 as maximal and CMC gauge conditions respectively. 
From the scaling of the tracefree extrinsic curvature in 1.2.9.3, the tracefree momentum 

scales as p^^^ = <j)~^p^'^^ 

(263) 

since it is a density of weight 1. The original (vacuum maximal) Lichnerowicz equation now 
takes the form 81\4>+M4>~'^ -R<j) = , hM = pop>0, 

(264) 

whereas the vacuum CMC Lichnerowicz-York equation (|246|) is 

8A(/) + 7W(/)~^ + = , hM=p^op^> (265) 



It is important to distinguish between a single initial use of a slicing condition in order 
to find consistent initial data and subsequent use of the slicing when the obtained initial 



data are propagated forward. This is by no means obligatory. The EFE's are such that 
once consistent initial data have been found they can be propagated with freely specified 
lapse and shift. This is precisely the content of 4-d general covariance. 

If one wishes to attempt to maintain the maximal gauge condition H261|) . a must satisfy 

a 

the maximal lapse-fixing equation (LFE) —p o p—Aa = . 

(266) 

In this thesis, I call such equations as LFE's, because, since GR is not always be presup- 
posed, I do not always work in a context where the notion of slicing of GR-like spacetime 
makes sense. Being homogeneous, (|266|) does not fix a uniquely but only up to global 
A-reparameterization a — > f{X)a, where /(A) is an arbitrary monotonic function of A. 

Similarly, to maintain the CMC gauge condition during the evolution, it is necessary to 
choose the lapse a in such a way that it satisfies the CMC LFE 

The above lapse-fixing equations are for vacuum GR, following from the vacuum version 
of the first form of (|15H) . The versions with matter are likewise easily obtained from the 
full equation (|151|) . 

Maintenance of CMC slicing yields a foliation that is extremely convenient in the case 
of globally hyperbolic spatially-CWB spacetimes. The foliation is unique [HOI I267j . and 
the value of A increases monotonically, either from — oo to oo in the case of a Big-Bang to 
Big-Crunch cosmological solution or from — oo to zero in the case of eternally expanding 
universes. In the first case, the volume of the universe increases monotonically from zero 
to a maximum expansion, at which the maximal condition H261() is satisfied, after which it 
decreases monotonically to zero. In CWB GR, the total spatial volume cannot be main- 
tained constant except momentarily at maximum expansion, when Ty = -^j^ = 0. Thus, in 
CWB GR the volume is a dynamical variable. This corresponds to the maximal LFE not 
being soluble in this case (see C.l). The above properties of ry allow its interpretation as 
a notion of time, the extrinsic York time (see e.g [20 H I47j). 

The maximal and CMC gauges in GR exhibit the following gauge artifact known as the 
collapse of the lapse: a blowup in R (for example in gravitational collapse) tends to drive a 
to zero. Thus these gauges are singularity-avoiding. But not all singularities are avoided: 
Eardley and Smarr |117j found an example of Tolman-Bondi model in which gravitational 
collapse is too sudden (i.e the curvature profile is too spiked) so that the LFE does not 
manage to drive a to zero before the singularity is reached. Such singularity-avoiding gauges 
are not necessarily a good numerical strategy to use H^l because they cause other difficulties 
though stretching the numerical grid. The CMC gauge is also not always applicable: not 
all GR spacetimes are CMC sliceable, nor is a CMC slicing necessarily extendible to cover 
the maximal analytic extension of a spacetime |336j . 

Note that the above are only partial gauge fixings. The shift Pi is as yet unspecified. 
One numerically-useful choice for this is to use the shift given by the minimal distortion 
condition |336j . Another choice (which in fact generalizes this) is to obtain both a and /3j 
from solving the conformal thin sandwich. 

Finally, note that one is entirely free to use whatever gauge in GR. In each case the 
gauge can be treated as above in terms of equations for a and /?, (e.g see II. 3. 2 for the 
normal gauge, or _336 for the harmonic gauge, which turns out to be closely analogous to 
the electromagnetic Lorenz gauge). 

2.11 Numerical applications 

Solving the conformal IVP equations is central to the study of binary compact object 
data in numerical relativity. Often simplified versions of these equations are considered 



(as explained in App C). The simplest treatments predate York's work: the Misner |27L)j 
and Brill-Lindquist [51] multiple black hole data solve just the flat-space Laplace equation 
case. Lindquist |254j also treated Einstein-Maxwell wormholes in the two flat-space Laplace 
equation case. The Bowen-York data |393j solve another simple case (a system of flat-space 
Poisson equations). Use of the conformal thin sandwich formulation in data construction 
is currently popular partly because of its practical motivation, since unlike for the 
simpliflcation Kij = which is trivial because it implies p^^ = and momentum is conserved, 
hjj = is rather the condition for (quasi)equilibrium, which need not be maintained at later 
times. 

The motivation to consider more complicated data is physical and is to be tested by com- 
parison with gravitational wave signals expected to be observed over the next few years [H^ . 
The numerical relativity side of this will require reasonably long-lived simulations (relative 
to the orbital period) of evolution of realistic data. The viability of use of the conformal 
method alone requires the tractability of gravity wave emission within this formalism, for 
which there is recent evidence jl63j . Traditional binary black hole evolution simulations 
have been of head-on and hence axisymmetric collisions |334( I17j . Modern work attempts to 
study more likely collisions following from the more general modern data above. Excision of 
black hole interiors (see C.2.3) is currently favoured over use of singularity-avoiding gauges. 
Nevertheless, simulations to date have been numerically unstable on small timescales (see 
e.g @S])- The study of binary neutron star evolution is currently more fruitful |331j . 

2.12 Further formulations of the EFE's 

The EFE's have been studied in many guises for a number of different purposes. The 
guises may roughly be qualified by what geometrical object represents the gravitational 
field (metric or 'bein'), by how this is split (a matter of signatures) and how this split is to 
be interpreted (a matter of prescriptions), by how the split equations are pieced together to 
make systems of particular forms, and by how extra variables may be introduced. Among 
the purposes are both practical and theoretical aspects of numerics, and quantization. 

2.12.1 Metric formulations 

If one uses the metric, one may then apply the (3, 0; -1) ADM split. The resulting system 
()83II841 11881 I189() may then be modified to form the equivalent but numerically better- 
behaved BSSN system j45| by use of conformal IVP-like variables, adding constraints to 
the evolution equations and using f ^ as new variables. One can also form larger 'Einstein- 
Ricci' or 'Einstein-Christoffel' i4| systems, seeking for particular kinds of hyperbolicity, 
which give good theorems to protect the numerics. One could also use an ' Einstein- Weyl' 
system with extra 'electric' and 'magnetic' Weyl tensor variables. One such formulation 
is the threading formulation jl741 I132j in which the fluid flow congruence (rather than 
hypersurfaces perpendicular to it) is treated as primary, on the grounds that the information 
available to us as observers is on incoming geodesies and not on some spatial surface. One 
has then a 'deliberately incomplete' system if regarded from the foliation perspective. Weyl 
variables are also employed in |332j (used in Part B). 

2.12.2 'Bein' formulations 

One can start again using 'beins' Ea^ such that qab = ^a^^bA ' 

(268) 

or spacetime spinors. This is useful to accommodate Dirac fields and in the study of 
supergravity. The unsplit spinor formulation is useful in the study of exact solutions. One 
can again do a (3, 0; -1) split (see VI.4). Note that there are then additional 'frame 



rotation' constraints A somewhat different 'bein' gives Ashtekar variables (see below), 
of importance in quantization attempts. 

2.12.3 Ashtekar variables 

Pass from {hah, p"^) to a SU(2) connection A^*^ and its conjugate momentum E^'ab [which 
is related to the 3-metric by hah = —t'r{Ea^h)]-^'^ The constraints are 

DaE%B = 5aE%B + | [E^, E"] | ab = , (269) 

triE^'Yah) = , (270) 

tr(E"E"''Fab) = . (271) 

H269|) arises because one is using a first-order formalism. Note that in this Ashtekar for- 
malism pi] it has the form of an SU(2) Yang-Mills constraint. H27U|) and H269() are the 
polynomial forms that the momentum and Hamiltonian constraints respectively, where 
F*^ = 29[aA^j^ -I- |[Aa,A^]|^^ is the field strength corresponding to Aq*^. One can see that 
H27U|) is indeed associated with momentum since it is the condition for a vanishing Poynting 
vector. The Hamiltonian constraint (|271j) has no such clear-cut interpretation. 

It is important for the formulation that Aq*^ is a self-dual connection, which makes this 
formulation specific to dimension 4. One has more or less a map of the GR phase space into 
the Yang-Mills one, which is exploited below. 'More or less' means that the asymptotics 
used are different, and also that the GR phase space has been enlarged to include the 
degenerate metrics. Finally, the Ashtekar variables formulation is of complex GR. But one 
requires troublesome reality conditions (see 1.3.3.3) in order to recover real GR. Nowadays 
so as to avoid reality conditions, one usually prefers to work not with Ashtekar's original 
complex variables but with Barbero's real variables j27j . 

2.12.4 Formulations using different splits 

Finally, one need not stick to using a (3, 0; -1) or indeed (r, s; e) split. One could split 
with respect to null surfaces: the characteristic formulation |38flj . This may be useful 
numerically; it is well-suited to the study of gravitational waves. One might use a mixture 
of Cauchy and characteristic formulations for this purpose. The difficulty then is in how 
to match the two regions. One could also split in the '2 -|- 2' way (102) . in which the 
gravitational d.o.f 's are isolated as the conformal 2-metric. These are not yet fully explored 
options. 

''■^The capital typewriter indices denote the Ashtekar variable use of internal spinorial SU(2). tr denotes 
the trace over these. Da is the SU(2) covariant derivative as defined in the first equality of 12691 1. 



3 Quantum physics 



3.1 Finite systems 

This might be based on the postulates 

QMl Associated with each physical state s of a system, there exists a wavefunction tps 
which is a ray in a complex vector space V. 

QM2 Classical quantities A have associated hermitian operators A. The form of A in terms 
of the fundamental variables qp,, is supposedly reflected by the form of A in terms of qa, 
p^. 

QMS Measurements and probabilities: V has an associated inner product (i.p) 

< II >: y X y — > K so that V is Hilbert. This i.p admits a probabilistic interpretation: 

< V'l^lV' > is the expectation value of A, and < V'ilV'2 > is the overlap probability or 
transition amplitude between states 1 and 2. The i.p is required to be positive-definite 
to permit the probabilistic interpretation, and normalizablc. By the 'Copenhagen' inter- 
pretation, making measurements supposedly entails collapse of the wavefunction from a 
probability distribution of states to a particular state, so that the measurement yields a 
single real number (the eigenvalue of that state). 

QM4 The wavefunction unitarily evolves in time. For a nonrelativistic quantum system 
this evolution is given by a time- dependent Schrddinger equation (TDSE) 

H|^ >=idt\ip > . (272) 

This can yield a time-independent Schrddinger equation (TISE) for stationary systems 

HIV' >= ElV' > (273) 



by separation of variables. The Schrodinger i.p is < V'ilV'2 >= / dap — S'^'^il)i dn ^'2 • 

(274) 

The canonical procedure is to pass from Poisson brackets to equal-time commutation 
velations{ETCR's) \[qA , /]| = i^A^ , Hqa , Qb]] = \ {p'' , P^]\ = ■ 

(275) 

I will always choose to use the position representation in setting up quantum operators: 

d 

Qk = qk, P^ = ■ (276) 

Obtain the Hamiltonian in terms of these and solve for the eigenspectrum and eigenfunctions 
(wavefunctions) . To have the standard interpretation available, one must know which inner 
product to use. 

The semiclassical approximation, tied to Hamilton-Jacobi theory, gives a sometimes 
useful approximate view of wavefunctions peaking around classical trajectories. 

3.2 Infinite systems: quantum field theory 

If one attempts to use < V'i|V'2 >= / dap-^rj^'^tpi dn ^^2 

(277) 

in Klein-Gordon theory, the probabilistic interpretation is ruined because rj^'^ is now in- 
definite. Fortunately, if c and are promoted to operators, one happens to obtain an 
acceptable i.p (this is now a many particle interpretation) . Quantize canonically by impos- 
ing the obvious ETCR's, obtaining H and solving it e.g in close analogy with an infinite 
collection of uncoupled harmonic oscillators. 



Considering instead Dirac theory adds little: the fields being fermions, one imposes 
ETantiCR's, and being charged, one uses distinct oscillators for e~ and But the quan- 
tum theory of the electromagnetic field is complicated by its constraint. Although electro- 
magnetism is tractable, theories with constraints Cx generally lend themselves to ambiguity 
of procedure: should these be imposed classically (which may complicate the ETCR's), or 
quantum-mechanically as restrictions Cx\ip >= on the permissible wavefunctions? The 
latter case generally suffers from Cx\'4^ >= 7^ |[Cx,Cy]||^ >= unless one is lucky with 
the operator ordering. Interacting QFT's are treated as perturbative expansions in the 
coupling constants, leading to the QED, Weinberg-Salam and QCD successes mentioned in 



QFT has well-known technical difficulties. Corrections due to loop contributions may 
destroy classical symmetries: anomalies. Integrals may not be well-defined, requiring regu- 
larization. Also, some theories require renormalization (counterterms in order to get finite 
answers), whereas others are unsatisfactory in being nonrenormalizable. More details of 
naive renormalizability are used in IV. 

3.3 Approaches to quantum gravity 

3.3.1 Outline of need for and approaches to quantum gravity 

Given quantum physics holds for three of the four fundamental forces and for matter, 
it is unseemly for the involvement of gravitation to cheat quantum physics (see [TS] for 
references). Without further development, the EFE's read 



A fully quantum description would be more satisfactory. Whereas all everyday physics can 
be understood by neglecting either GR or QM (consider the spacetime curvature inside 
CERN or the Compton wavelength of Mercury) , Planckian regions where neither are neg- 
ligible are conceivable. This situation is thought to occur in the very early universe and in 
black hole physics (both associated with classical GR breakdown at singularities), as well 
as possibly in very fine detail of space or spacetime structure. 

Quantizing gravity is hard; one talks mostly of programs. Some have failed. Others have 
serious theoretical difficulties (see below). None has predicted anything that is testable so 
as to provide convincing evidence. Even if some genuinely theoretically-flawless theory is 
established, what of inequivalent quantizations of the same scheme? And one can never 
dismiss the possibility of entirely different flawless theories being subsequently discovered. 
Without confirmatory (and discriminatory) experiments, physics reverts from Galilean to 
Aristotelian, regardless of the blinding usage of modern mathematics. A lot of research is 
going into phenomenology! 

There are two traditional branching programs for quantizing gravity: the canonical 
(3.3.2) and the covariant (3.3.4). Their woes are explained below. Another classification 
of attempts is into 'top-down' which take the known laws and attempt to deduce quantum 
gravitational behaviour, and 'bottom-up' which guess at the constituent structure of nature 
and attempt to recover known laws. The former aims at respectful extrapolation, whereas 
the latter may benefit from (perhaps new, perhaps rich) mathematical structure assumed, at 
the price of present unjustifiability of such assumptions. Additionally, it may be incomplete 
or generalized QM or GR that is involved. 

Note that the above remain programs, as is made clear by the critical nature of many 
papers in the subject. There are mixed-ontology and nonstandard versions of these pro- 
grams, while other practitioners have had their reasons to consider completely different 
schemes |761 l2U2j . With the full theories being hard, toy theories (3.3.3-4) are frequently 
used to voice both innovations and criticisms. This thesis extends or stems from Barbour's 



1.1.7. 



classical geometry as curvature 




quantum matter expectation value 
as energy-momentum 




attempt toward quantization, which has closed-universe, known-law, canonical, traditional 
variable connotations. Innovation vies with criticism; old toys are used and new toys (or 
just possibly alternative theories) emerge. 

3.3.2 The canonical approach and the Problem of time 

Note how QM uses external time in QM4 whereas there is no such thing in GR.^^ So what 
would ETCR's now be? Should one seek to identify a time before or after quantization or 
not at all? With its Hamiltonian being zero, GR looks rather like a (E = 0) TISE, so is 
it frozen rather than unitarily-evolving? These are some aspects of the Problem of Time. 
GR is constrained, so should one quantize it before or after constraining?^^ There are 
also Problems with finding i.p's and observables, and operator ordering, regularization and 
defining measures, depending on what one attempts. And these Problems are interlinked. 
Conceptually, why is nature classical to good approximation? Semiclassical Hamilton- 
Jacobi interpretations run into 'what is a superposition of geometries?' Embeddability is 
surely lost. Is nature a spacetime foam leading to loss of causality and (or) permissible 
topology change j376j ? Also, the 'Copenhagen' connotations in QM3 of the centrality 
of measurement by classical observers cleanly external to the observed system make no 
sense for a closed quantum universe. One usually gets round this one by using rather a 
many-worlds interpretation rather than a wavefunction collapse interpretation |in6| 1172] . 

Choosing to work in traditional variables hab{x), p°'^{x), one requires a fixed choice of 
the 3-space topology. Choosing to quantize before constraining, I elevate these to operators 
and choose the configuration representation (analogue of the position representation)^^ 

hij{x) hij{x) = hi,{x) , p'^{x) p'\x) = " - i-r^'' . (278) 

Furthermore one can choose to pass from to standard ETCR's 

\[hai>{x),p'\y)]\ = i5^,,f''f''\x,y) . (279) 



The momentum constraint can then be chosen to read — ^^1^' >= 

dhab{x) 

(280) 

(ordered with hij to the left) which signifies that \^ > depends on the 3-geometry alone. 
The Hamiltonian constraint Tl becomes the Wheeler-DeWitt equation (WDE)^^ 

n\^l; >= " (Gabcd{hab{x))—-^—— - ^{hab{x))R{Kb{x))) "|^ >= . (281) 
V dhab{x)dhcd{x) ) 

This is manifestly timeless (it does not contain any i-^^ or '^J^^ ). One consequence of 
this is that there is no conventional Schrodinger or Klein-Gordon i.p. One could treat (|281j) 
as an equation on Superspace; the indefiniteness of the DeWitt supermetric then provides 

*''QFT in spacetimes such as Minkowski relies on timelike killing vectors which in turn are only present 
due to high symmetry, whereas generic solutions of GR have no symmetry. 

'^^ K third option is to take Jii into account classically and Ti into account quantum-mechanically: Super- 
space quantization. 

^^In everyday QM, experiment would settle any such ambiguities and support the correctness of theoretical 
technicalities employed. 

■^^I always make this choice in this Thesis. I use " " to denote heuristic rather than well defined tractable 
expressions. 

■^^This is written here with the p*^ to the right of the hij ordering, and the heuristics include a lack of 
consideration for regularization. 



a 'time'. This cannot permit a Schrodinger i.p due to the indefiniteness of the metric (as 
in Klein-Gordon theory). Furthermore for GR the Klein-Gordon implementation 

< *i||^2 >= ^ n / d-^A^^^ ^ ^2) , (282) 

is also blocked by the absence of several convenient features that Klein-Gordon theory hap- 
pened to have. GR has a lack of well defined -, + mode separation tied to nonstationarity 
and an indefinite potential R (unlike mc;'^) so that this i.p is not normalizable. Two other 
not generally successful |236j interpretations of the WDE are the semiclassical interpreta- 
tion (akin to Hamilton-Jacobi formulation, using Euclidean path integrals jl69j ) and third 
quantization (in which the wavefunction of the universe itself is promoted to an operator). 
Suppose instead that one can perform a mythical canonical transformation 

hi,{x),p'^{x) gA(x),P^(x);gr'(x),p*™<=^(x) (283) 

which separates out the embedding variables from the true d.o.f's. This is equivalent to 
finding an internal time (and exemplifies constraining before quantizing). Then in these 
new variables one has by construction constraints which look as if they will form a TDSE 
once quantized 

nA{q'r'{x),p'ri^y,QA{x),P^{x)) = Pa{x) - PA{qTi^),p'rixy,QA{x)) = O . (284) 

One can then employ the corresponding well-defined Schrodinger i.p. However it could 
be that these variables are nonunique and each choice leads to a distinct quantum theory. 
Besides not one such canonical transformation has been found to date: none of the internal 
time candidates considered has been satisfactory. These are intrinsic time (undeveloped 
j236j ). extrinsic time (e.g York time^^ |2341 12UI] or Einstein-Rosen time |225j ^. and time 
associated with matter fields (e.g Gaussian reference fluid |lfl5| 1240] I2f)3j ). 

Finally, one could accept the timelessness of GR. One could then try the naive Schrodinger 



interpretation (NSI) jUSl EHl ESHl, based on the i.p <^^\-^2>= 




(285) 

This permits the use of certain relative probabilities, but is conventionally regarded as of 
limited use since it does not permit answers to questions of evolution. It is further discussed 
in VIII. 1-2. An attempted refinement of this is the conditional probability interpretation 
j286j : the sum over histories interpretation jl72j may be viewed as another such, and has 
been further developed as the generally- covariant histories formalism j2U41 13231 12221 
An unrelated timeless approach involves perennials (evolving constants of the motion) |314j . 

One could approach canonical quantization instead using Ashtekar variables. For the 
original complex Ashtekar variables, the form of the constraints (|269I - 07T|) makes operator 
ordering Problems less severe. One then attempts to use the reality conditions to construct 
the i.p. However, Kuchaf j237j argued that unlikely hopes were being placed on these 
reality conditions. Real Ashtekar variables have since been adopted, which have not been 
comprehensively criticized. The loop representation j317| I147| I25j (which works for both 
complex and real Ashtekar variables) provides a natural regularization, and leads to a 
discrete spacetime picture j318j . Modern approaches include Thiemann's construction of a 
quantum spin dynamics Hamiltonian operator |3541 13551 1357j , and the spin foam approach 
|293j . Because degenerate triads are being allowed and then exploited to obtain these results, 
I am not sure whether one is actually still considering GR. Furthermore, the recovery of 
semiclassical space or spacetime is turning out to be difficult. 

**This is actually good as a time but its implementation soon turns sour in other ways - see VIII. 3 



The canonical approaches are based on the idea that quantizing GR alone is possible. 
Then matter can be 'added on' |241 1356j . This is a counterpoint to the stringy unification 
below. Indeed, for all we experimentally know, nature might hinge on specifically 4-d, non- 
unified, non-supersymmetric properties. Gravitation might play a distinguished role not 
through unification but as a universal regulator I356| . 

3.3.3 Minisuperspace quantum cosmology and other toys 

Minisuperspace (1.2.7.1) was conceived in the late 1960's by Misner |272j as a toy quan- 
tum cosmology. It is however a gross truncation, amounting to entirely suppressing the 
momentum constraint, leading to finite-dimensionality, and thus neglecting most of GR's 
technical Problems. It is not even a self-consistent truncation ^239 in that the Taub model 
behaves differently from the Bianchi JX model that contains it! And such truncations are 
not QM solutions since the uncertainty principle is disregarded in switching off inhomoge- 
neous modes. I use Minisuperspace as a preliminary testing ground. If a supposedly general 
statement fails to be true even for Minisuperspace... (see V.l, VI. 1.3). 

Quantum cosmology was revived in the 1980's jl73l 11651 1361j . More realistic Midisu- 
perspace models (simple infinite-d models) such as with Einstein-Rosen and Gowdy models 
(see e.g ^25, 162 ) have been studied. Recently, Bojowald IHSI has begun to investigate 
Minisuperspace quantum cosmology arising from quantum spin dynamics. As this is a 
full quantum theory, such truncations then actually do make quantum-mechanical sense; a 
discrete singularity-free picture is beginning to emerge. 

Minisuperspace is but one of many toys used to speculate about the behaviour of full 
GR. Other toys include parameterized fields jll21 123UI I231j , relativistic particles in curved 
spacetime |168j anharmonic oscillators j326| I24j . perturbations away from homogeneous 
models jl71j . 2+1 GR [TS], 1+1 dilaton-gravity , and strings instead of 3-geometries |2411 
I222j . See II, III and VIII for more theories which may be regarded as providing toys 
(some of which are new): strong-coupled gravities, Barbour-Bertotti-type particle models, 
and privileged-slicing conformal theories. Different toys are required to uncover different 
aspects of full quantum GR, whilst it should be borne in mind that no particular toy works 
well. Indeed extrapolation from toys is dangerous since many proposals that are tractable 
for a toy are specifically tied to non-generalizable features of that toy. 

3.3.4 The covariant approach and unification 

The idea is to split the metric into background (usually Minkowski) and small perturbation 
pieces: gAB = 9ab + 9ab- This clearly has spacetime and fixed-background connotations. 
One then tries to treat this as a QFT for a 'spin-2 graviton' g^^, which has particle physics 
scattering connotations. This suffers from nonrenormalizability. One way around this is to 
alter the gravitational theory! For example, in higher-derivative theory, renormalizability 
can be obtained but only at the price of nonunitarity j341j . 

Because the subject has moved on from here to be considered alongside unification, it is 
appropriate to first consider the unificatory input. It has been suggested in particle physics 
to try to use in place of SU(3) x SU(2) x U(l) a single Grand Unified Theory (GUT) gauge 
group e.g SU(5) or SO(IO). One has then less free parameters, e.g just one fundamental 
coupling constant, as well as predictiveness through novel (but to date unverified) particle 
processes such as proton decay. Good convergence of the coupling constants, the hierar- 
chy Problem (explanation of why the GUT scale of convergence is so much greater than 
the electroweak scale), and superior renormalization, may be obtained by super symmetry. 
This is the hypothesis that each observed bosonic and fermionic species has respectively a 
fermionic or bosonic superpartner species. Although none of these have ever been seen, if 
the hierarchy Problem is to be resolved in this way, the forthcoming generation of particle 
accelerators are predicted to see superparticles. Supersymmetry may be incorporated by 



passing to a minimally super symmetric standard model, or by considering super symmetric 
GUT'S. 

Now, local supersymmetry is one route^^ to supergravity |143j . This includes a massless 
spin-2 field identified as gravity, and 11-d is picked out by uniqueness arguments giving 
a single set of (super)particle multiplets. One then has the Problem of accounting for 
the apparent 4-d world. However, the old compactification idea of Kaluza-Klein theory^'^ 
whereby unwanted dimensions are currently unprobed through being curled up small, was 
then revived as a possible reconciliation. For a long time supergravity was believed to be 
renormalizable, but unfortunately this fails at higher orders |51j . 

However, the failed string theory of the strong force was investigated in a new light, 
and the closed string spectrum was found to include^^ a massless spin-2 field, which was 
then conjectured to be the graviton, leading to a background-dependent, 26-d theory free 
of renormalization, anomaly and unitarity difficulties jl64j . Another use of supersymmetry 
was then found to cure its tachyon Problem, trading the 26-d for the 10-d of superstring 
theory. But this was found to be nonunique (five string theories were found), and although 
perturbatively finite order- by-order, the sum of the orders nevertheless diverged. 11-d 
M-theory [3841 I358j comprises relations (dualities) between the small and the large, the 
weakly-coupled and the strongly coupled, thereby connecting the five string theories and 
supergravity. It is intended to be nonperturbative. It does not as yet exist as a theory. 
Another aspect of modern string/M theory is the inclusion of extended objects with more 
dimensions than strings: {mem) branes |299j . 

Here are some simple stringy regimes. Lovelock gravity (p 65) arises as a correction to 
GR, a Born-Infeld theory^^ arises as a correction to Einstein-Maxwell theory. To attempt 
particle phenomenology, traditionally the extra dimensions are compactified, typically into 
a Calabi-Yau space. The Hofava-Witten model [187j has large extra dimensions of AdS 
(anti de Sitter) form; there are analogous braneworld toy models of this form discussed in 
Part B. 

One concern is that string theoretic phenomenology suffers from severe nonuniqueness - 
there are very many ways of hiding the extra dimensions'^ (e.g types of Calabi-Yau space, 
or of embeddings with large extra dimensions), and the corresponding split-up of matter 
fields (including gauge group breaking) can also be done in many ways. Thus one obtains 
actions with many scalar fields, along with p-form fields. Another concern is whether some 
of the model actions are unlikely to reflect truly string-theoretic properties (also discussed 
in Part B). 



The analogue of the Fierz-Pauh route to GR. 

^°This is a unification of electromagnetism and gravity by means of a 5-d geometry with one Planckian- 
radius cylindrical dimension. It is considered to be a failure at the quantum level because of the particle 
mode excitations being of Planck energy and thus not realistic particles! Witten |382| suggested a larger 
Kaluza-Klein theory to accommodate strong and weak forces also; this can be done in 11-d, with more 
complicated compact topology than cylindrical, to reflect the standard model gauge group. 

Alongside this, it contains a non-minimally coupled scalar dilaton and an antisymmetric form-field. 
Gauge fields are to be included on the ends of open strings. 

^■^The general Born-Infeld Lagrangian density is L = /{FabjQab), whereas the stringy case is the specific 
form L = ^ydet{g + F). 

^■^ Although neglected, one way out would be to have alternative 4-d string theories (e.g |140| or maybe 
using Liouville strings). 



A The 3-space approach 



II Answering Wheeler's question 

"// one did not know the Einstein-Hamilton-Jacobi equation, how might one hope to derive 
it straight off from plausible first principles without ever going through the formulation of 
the Einstein field equations themselves?^' John Archibald Wheeler j376j 

The GR Einstein-Hamilton-Jacobi equation is made by substituting p^^ = into the 
Hamiltonian constraint Ti. This is a supplementary constraint equation (|56)) rather than a 
Hamilton-Jacobi equation H55() per se.^^ 

Thus the first stage of answering the question above involves finding a derivation of the 
GR form of the Hamiltonian constraint 

n = G^jkif'p''' - VhR = , (286) 

where Gijki = {hi(k\hj\i) — \hijhki) is the DeWitt supermetric. ADM got this form 
precisely by starting off with the 4-d EFE's and performing their 3 + 1 split followed by the 
passage to the Hamiltonian (third route). Wheeler's question is about not only the reverse 
of ADM's work (fourth route), but also about whether first principles can be found upon 
which this is to rest. Because Wheeler listed six routes, Hojman, Kuchaf and Teitelboim 
(HKT) called their derivation from deformation algebra first principles the seventh route. 
I take "seventh route" to mean any route which leads to geometrodynamics regained from 
first principles. 

In addition to the HKT approach (H.l), there is now also a different 3-space approach 
(TSA ) started by Barbour, Foster and O Murchadha (BFO) (H.2),^^ which entails a distinct 
ontology (see H.3). In brief, HKT presuppose spacetime whereas BFO presuppose space 
alone. 



1 Hojman, Kuchaf and Teitelboim's answer 

The constraints Ti and Tii of GR close as the Dirac Algebra^^ 

{n{x),n{y)} = n\x)5,,{x, y) + W{y)5,,{x, y) 

{n^{x),n{y)}=n{x)5,i{x,y) (287) 
{Hi{x),nj{y)} =ni{y)6j{x,y) +nj{x)6^i{x,y) . 



This was originally derived by Dirac for slices in Minkowski spacetime |lfl9) and then DeWitt 
jlU6j established by brute force that it holds for general spacetimes. Then Teitelboim 
|349j showed that the Dirac Algebra may be geometrically interpreted as the embeddability 
condition.^'' 

The HKT idea |186j is to consider the algebra of deformations of a spacelike hypersurface, 

{W^(x), H^(y)} = n''\x)6,^{x, y) + n''\y)5,,{x, y) (288) 
{ni{x),n\y)] =n''{x)5,,{x,y) (289) 
{nf{x),n'^{y)} = nf{y)6,j{x,y)+n'j{x)5,,{x,y) , (290) 



^''it would become a Hamilton-Jacobi equation however were it reformulated in terms of an internal time. 
^^N.B I distinguish between TSA the program name and BFO the first attempt. HKT is a consolidation; 
I hope to extract and publish a consolidation of the TSA from this thesis ;9,- 
^^This was already presented in a smeared way as (1193141951 . 

^^Equivalently one can talk of the evolution of the 3-geometry being path-independent (in Superspace), 
also known as foliation invariance. 



as primary i.e as their plausible first principles. TL'^ is a pure deformation (see fig 8), whereas 
Tif is a stretching within the hypersurface itself. In doing so, HKT are following Wheeler's 
advice of presupposing embeddability into spacetime in order to answer his question, since 
HKT's first principles encapsulate embeddability. HKT next demand the 'representation 
postulate': that the "H*"*^', 7^*^"^^ for a prospective gravitational theory close as the "H^, 
Tif do (i.e as the mathematical structure commonly known as the Dirac Algebra, but now 
regarded as emerging as the deformation algebra) . 




deformation = stretching of coordinates + pure deformation 



Figure 8: 

Their derivation spanned several years and a number of half-way-stage papers |1841 
I228L I185j . in which various other assumptions were removed. The further time-reversal 
assumption of |186j was removed in j227j . 

The form of Tli essentially follows from it being a tangential deformation within each 
hypersurface, so one is quickly down to Wheeler's question about Ti alone. From deforma- 
tional first principles, 

W^ix') hij{x) = -2K,j{x)5{x,x), (291) 

from which it follows that any representation of l-f^ must be ultralocal in p^K From de- 
formational first principles, (289) holds which implies Ti'^ is a scalar density of weight 1. 
One can also deduce the form of (|290)) from deformational first principles. From this it 
follows by use of induction that T-C^ is quadratic in p^^ and also, in addition using Lovelock's 
theorem^^, that the potential is aR + K for cr = 1 (it works equally well for a = —1 but one 
is then presupposing embeddability into a Euclidean signature geometry). I emphasize that 
these proofs follow from the elements of spacetime structure present in the mathematics of 
the deformation algebra. I discuss an unsatisfactory residual assumption associated with 
these proofs in VI. 1.5. 

Recollect that since the hope of pure geometrodynamics being by itself a total unified 
theory along the lines of RMW theory has largely been abandoned, asking about the form 
of Ti translates to asking about the form of which includes all the known fundamental 
matter fields, ^ . So given any seventh route to relativity from some first principles, one 
can assess whether these first principles are truly plausible by seeing if some form of them 
naturally extends to permit a route to relativity with all the known fundamental matter 
fields 'added on'. For HKT, the representation postulate idea extends additively (at least 
naively) to matter contributions to 7i and 7ii. In this way Teitelboim j351j included elec- 
tromagnetism and Yang-Mills theory. One must note however the absence of spin-^ fields 
from this list j374j . without which the HKT seventh route is not yet satisfactory by our 
criterion. 

Kouletsis j221j has recently worked on a variation on HKT's work (related to the 
generally-covariant histories formalism) which makes additional explicit use of spacetime 
structure so as to clarify the spacetime origin of HKT's postulates. In this thesis, I do the 
opposite: rather than introducing yet more spacetime structure than HKT, I attempt to 
use less, along the lines of the next section. 

^^Thus the result is dimension-dependent. Just as the Einstein-Hilbert action is a topological invariant 
in dimension n = 2 but leads to nontrivial theories for n > 2, the Gauss-Bonnet topological invariant in 
dimension n = 4 may be used as a nontrivial action for n > 4, and this sequence continues for all higher odd 
dimensions. The gravity resulting from any linear combination of these actions obeys the same simplicities 
as GR. Thus for n > 4 there are Lovelock gravities other than GR. Teitelboim and Zanelli have shown that 
Lovelock gravity's constraints also close as the Dirac Algebra |353| . 



^ 

2 Barbour, Foster and O Murchadha's answer: 
the 3-Space approach (TSA) 

Barbour, Foster and 6 Murchadha (BFO) j38j require mere closure in place of closure as 
the Dirac Algebra. Whereas HKT's requirement that the Dirac Algebra be reproduced 
imports the embeddability into 4-d spacetime, BFO have been able to show that this is 
largely unnecessary.^^ For they were able to derive GR from 3-d principles alone, by use 
of Dirac's generalized Hamiltonian dynamics to exhaustively provide a highly restrictive 
scheme (see ^U] or 1.1.2.3). Thus the two answers advocate distinct ontologies: HKT 
assume the structure of 4-d spacetime (implicitly within the embeddability assumption 
rather than explicitly), whereas BFO adopt a TSA i.e they assume the structure of space 
alone. 

I begin by considering the 'Relativity without Relativity' (RWR) paper in which BFO, 
starting from their 3-space point of view, obtained new insights into the origin of both special 
and general relativity. Furthermore, when matter is 'added on', the null cone structure and 
the Abelian gauge theory of electromagnetism are enforced and share a common origin. 
The TSA also gives rise to new theories of evolving 3-geometries: strong gravity theories, 
and conformal gravity theories. 

My first four contributions to this TSA program are: first the tightening of RWR in this 
section based on ^1 151 IH] and additional original work, thus making the pure gravity TSA 
more complete and correct. Second, the discovery and investigation of alternative strong and 
conformal theories of evolving 3-geometries (III and V). Third, the inclusion of fundamental 
physical fields (IV) , extending RWR to the case of many interacting 1-form fields in which 
case Yang-Mills theory is recovered, observing how matter couples to strong gravity, and 
demonstrating that the viability of conformal gravity is not threatened by preclusion of well- 
established fundamental bosonic fields. I adopt a systematic approach in which it suffices 
to treat the matter terms piecemeal and independently of the gravitational working, thanks 
to working at the level of the general matter ELE. All cases of interest can then be quickly 
retrieved from this working. Fourth, I criticize the TSA as conceived in RWR (VI). This 
is necessary for practical reasons and for including spin-| fermion fields. Then I provide a 
new interpretation of the TSA which may be viewed from within Kuchaf 's 'split spacetime 
framework' (SSF) (or hypersurface framework) \1'1S)\ 12301 I23H I232j . which furthermore 
permits a quick diagnostic for what can be easily put into a sufficiently broad-minded TSA 
formulation to replace RWR. Thus it is demonstrated that there is a TSA formulation which 
permits the inclusion of spin-^ fermions and indeed of a set of fundamental matter fields 
rich enough to contain the Standard Model, i.e the simplest collection of matter fields that 
suffices to agree with observations, at a classical level. 

More recent contributions of mine to the TSA program, on the relation to the relativity 
principle and to the equivalence principle and the accommodation of massive 1-form fields, 
appear in V and VII. Ill is partly driven by preliminaries to quantization, which are revisited 
in the quantum TSA chapter VIII. 

2.1 The Barbour— Bertotti 1982 theory and best matching 
2.1.1 Machian point-particle theories 

I discuss here the origin of the ideas behind the TSA. As these date back to the inception of 
Newtonian physics, it is appropriate to begin by considering them in the context of particle 

^^In this section, this means unnecessary for the purpose of retrieving vacuum GR. Matter is considered 
in later chapters. 



mechanics. The ideas are relational as opposed to absolute. They are rooted in Leibniz' |249j 
'identity of indiscernibles' and were subsequently considered by Bishop Berkeley and 
Mach \2<jb\ . Whereas the ideas are sound, it counted heavily against them that nobody was 
able to construct any theories which implemented them. However, Barbour and Bertotti 
found two such point particle theories [361 I37j (BB77 and BB82). Whereas BB77 has 
experimental problems, BB82 does not, and its best matching framework may furthermore 
be used to formulate gauge theories and theories of 3-geometries (i.e of gravitation from 
a geometrodynamical viewpoint). As Kuchaf pointed out |233j . the 3-geometry case of 
the BB82 model pretty much gives GR in what is more or less identifiable with the BSW 
formulation and with a close analogue of the Jacobi principle. So it gives a means of 
showing from a certain perspective that the direct implementation of Machian principles 
leads to the entirely orthodox gravitational physics of GR! However, Barbour and Bertotti 
perceived this as a disappointment, causing the program to stall for many years. But in 
fact they had stopped just short of a large set of interesting results in traditional-variable 
geometrodynamics. 

The Machian ideas for point-particle theories are 
Rl In particle dynamics, only the relative distances between the particles are physically- 
relevant. 

R2 Time is nothing but an arbitrary monotonic parameter A used to label the sequence of 
relative configurations that the universe passes through. 

Note that some relationalists only consider the first demand. Temporal relationalism 
was emphasized alongside spatial relationalism by Mach j265j . As we shall see below, the 
common supposition that distant masses govern local inertia (c.f the motivation behind 
Brans-Dicke theory jSH], or arguments that in some sense GR possesses non-Machian so- 
lutions) only sometimes arises from our adopted starting point: for particles it happens in 
the BB77 implementation, but not in the BB82 one. 

Thus mathematically, given n particles indexed by (i) whose positions are described by 
q^^^, spatiotemporal relationalists should demand actions that are invariant under the 

transformations of the 'Leibniz group' q,^, — > A(A)q,^, +g(A) , A(A) orthogonal 



A /(A) . (293) 

The transformations (|292)) . which would suffice for purely spatial relationalists, form the 
Euclidean group Eucl i.e the kinematic group of Euclidean space jll9j . To be more precise, 
I take Eucl to comprise the group of small isometrics of Euclidean space. Note that it is 
from Eucl that the notions of distance and angle derive, which are the building-blocks for 
the relations. 

Invariance under these transformations is termed the kinematical principle of relativity 
j371j , which is notably different from the dynamical principle of relativity [invariance under 
the transformations (|34|) of the Galileo group]. Newton's point of view on the mismatch 
between the invariances 1)292(1 of the interactions and (|34|) of the inertial forces was that 
it proves the existence of the absolute structures that he postulated |28()j . However, Mach 
conjectured that a completely relational physics of the whole universe could give effective 
local physics that is invariant under ()34|) alone. Ehlers |119j claimed that Leibnizian space- 
time has insufficient structure^*^. However this is not correct according to one interpretation, 
since Barbour and Bertotti then provided a satisfactory Leibnizian theory. 

The transformation (|293() adjoined to H292|) expresses the reparameterization invariance 
(RI) of the time label, in other words the absence of a meaningful absolute time. The origin 

^''Lange |247| and Cartan I8U| spacetimes have more structure. The former was not constructive in that no 
specific physical laws fitting the description of such a spacetime were postulated. The latter has Newtonian 
mechanics merged with Newtonian gravity: as in GR, the gravitation provides the inertial frames. 



of Barbour's claim [6Ul I34j that the laws of physics are timeless is that they appear to be 
expressible in terms of only relative times, the content of which is nothing but relative 
distances. This may be illustrated as follows. The RI Jacobi principle of Newtonian me- 
chanics gives the shape of the dynamical orbit of the whole system, but if one is considering 
the universe as a whole, then none of the physically-relevant observers (i.e those internal 
to the system since by definition there is nothing external to the system) could notice any 
difference at all if the rates of all motions in the universe were to simultaneously double. 
For example, if one considers the solar system (neglecting for the moment planet-planet 
perturbations) as a simple whole universe model, the answer to Jacobi's principle is then 
several confocal ellipses when represented in K^. Now, an internal observer may indeed allot 
a rate at which one of these ellipses is traversed, and use it to calibrate the rate at which the 
other ellipses are traversed. Thus the relative rate at which planets orbit the sun remains 
a meaningful concept. This amounts to using the motion of one planet as an internal clock 
with respect to which the motions of the other planets are measured. However, because we 
are interpreting this as a whole universe model, by definition there exists no clock external 
to the system with respect to which the motion of the planet-clock itself can be calibrated. 
So it would make no difference to the study of the system if another internal observer chose 
to study the system by allotting a different rate to the planet-clock. Note also how such 
use of a part of the motion as an internal clock is of limited accuracy. Planet-planet pertur- 
bations are not negligible, so one must consider increasingly more extended configurations 
of planets in the quest for accuracy, until all the planets are included. The only arbitrarily 
precise clock in the universe is the universe itself. 

The meaninglessness of performing translations and rotations on the whole n-particle 
universe is captured by working not on the (3n)-d configuration space Q(n) of the q,., but 

rather on the (3n - 6)-d (n > 3) quotient space 

Q(n) 

n-particle Relative Configuration Space RCS(n) = — — - . (294) 

The BB77 model [SH] is an attempt to work directly on this space, using the following 
implementations. 

RI[R2] A Jacobi- type action is considered (to have manifest RI), and 
Direct[Rl] the action is built out of Ibb77 = J dXV-Vf , 

y = _ y T= y "'(^"'(i) ( "^nrn V .295) 

This is in fact an often-rediscovered theory, first found by Reissner |31Uj and then by 
Schrodinger j327j . albeit BB77 obtained it on slightly different premises (spatiotemporal 
rather than purely spatial relationalism) . This theory is interesting e.g Schrodinger showed 
how it explains the perihelion precession of Mercury as well as GR does, and Barbour- 
Bertotti showed that it yields a new cosmology. And it relates local inertia to distant 
masses; however mass anisotropy arises thus, which is experimentally unacceptable to very 
high precision (5 parts in 10^^) |19L)llll4j . 

In the BB82 model, they use 
BM[R1] work indirectly on the RCS by considering the correction of the 'bare' velocities 

%) ^k,Qq(,) = q(^.) - k - X q^ , (296) 



where the k and are auxiliary variables associated with the action on q^^^ of the generators 
of the translations and the rotations. This corresponds to keeping one configuration fixed 



and shuffling a second other one around by means of translations and rotations as a means of 
casting the second configuration into as similar a form as possible to the first configuration. 
This procedure is called best matching (BM). The corrections are Lie derivatives 
corresponding to draggings in the unphysical directions. Thus indeed no additional structure 
is used in this construction. 

They furthermore use the implementation RI[R2], thus considering actions 

(n) 

Ibb82 = 2 / dA^(E-V(q(^p)T , T = ^ ^^^.f^q^) • fik,fiq(^.) • (297) 

(i)=(i) 



The particle momenta are p*-*^ = — = \ 

5q... V T ' - 



(298) 

Now, the presence of a square root means that the Lagrangian is homogeneous of degree 1 
in the velocities. It is this homogeneity property that guarantees the RI and furthermore 
implies that the theory must satisfy at least one primary constraint as an identity by the 
following argument of Dirac's |112j . The canonical momenta must be homogeneous of degree 
0. Thus they are functions of ratios of velocities alone, but there are only — 1 of these, so 
that there is at least one relation between the n momenta. I now show how this occurs in 
the above type of action: 



(i)=(i) ■■ (i)=(i) 



E-V ^ E-V 

= m'(i)6k,Qq(i) • fik,f7q(j) = ^j^2T = 2(E - V) . 

Thus the square root form gives rise to one primary constraint by "Pythagoras' theorem" : 

^ D • D 

E 9 -(E-V) = 0. (299) 

Varying with respect to the auxiliaries k and respectively, one obtains that the total 
momentum and angular momentum of the whole n-particle universe must be zero, 

(n) (n) 

(i)=(i) (i)=(i) 



The particle ELE's are fi^^pW = W (301) 

so, by coupling (|298() and (|3U1|) . if one picks the unique distinguished choice of label time 
such that VT = \A7, one recovers Newton's second Law (|83|) . This choice corresponds to 
the total energy of the universe also being zero. So both the BM[R1] and RI[R2] imple- 
mentations lead to constraints. These happen to be preserved by the evolution equations 
provided that the potential is a function of the relative separations alone. 

I consider it useful to view the above in other ways. I show here that the above auxiliaries 
are in fact not Lagrange multipliers but velocities associated with cyclic coordinates. This 



necessitates a more careful account of how the variation is to be done. It also means then 
the kinetic term truly remains homogeneous of degree 2 in its velocities upon Eucl-BM, 
thus indeed not compromising the RI of the action. 

I start with the implementation AF[R1]: I write down my actions in the appropriate 
arbitrary frame. This in my opinion ought to underly BM: the BM auxiliaries are velocities 
associated with the transformation between the stacked and arbitrary frames. Variation 
with respect to these auxiliaries then ensures that this choice of frame does not affect the 
physics (this is again an indirect implementation of Rl). 

For particle mechanics, writing down the action in an arbitrary frame requires use of 

'corrected coordinates' with respect to Eucl, q'^^^ = =-^1,© ) 

(302) 

for e'^ the frame basis and the components of the {i)th particle with respect to it, 
and for Ei^q the coordinate transformations of Eucl and where the arrow indicates group 
action (which happens to be on the underline since the action on the basis c index and the 
component c-index are opposite and cancel out). 

The potential should then be built to be Eucl-invariant from the start. Dealing with 
the kinetic term is trickier since ^ is not even Eucl-covariant, due to the involvement of 

the frame itself: ^ = j +^%) = \dX ) ^ ^'.^ • 

(303) 

Thus use of an arbitrary frame in building the action amounts to passing from the bare 

This amounts to bringing in Eucl-generators which infinitesimally drag in all unphysical 
directions (corresponding here to the translationary and rotationary status of the whole 
universe) . 

That one is treating each auxiliary not as a Lagrange multiplier q\ but as velocity 
qc corresponding to a cyclic coordinate qc is a justifiable choice, by use of the following 
equivalent free endpoint variation with respect to qc- Starting from the standard expression 

= 6S 



stacked velocity to the unstacked, BM velocity I I = fik.nqp-, for k = I and Q = Q. 




ei, £2 

where the first step uses since qc is cyclic, and noting that 5qc no longer vanishes at 
the endpoints ei and 62, and the second step uses the endpoint condition to fix the value 
of the constant pc- 

I also note that RI Lagrangian Eucl-BM correction corresponds precisely to the Hamil- 
tonian Dirac-appending of constraints according to 

H = T+v+k . y: p« +0-2: q(,) X p« = y: p^^^ • ■ e p^^^-^ • e %) - p^^^ 

(i) ii) (z) (z) (i) 



Y.(%)-^-Q-^(i) X P«)'-T-V = T(6k,Qq(,))-V(q(^p ^ U = 2^T(fik,nq(,))V(q^^p 

(304) 



'^I have since seen that Lynden-BeU also thought along these lines |261| . 



(presented in the T homogeneous quadratic in the (/(j) case) , where the first three steps form 
a Legendre transform and the fourth step is the passage to the Jacobi form. 



2.1.2 General best-matched reparameterization-invariant schemes 

Consider a collection Q of objects q/\ which are functions of position on some manifold and 

of label time A. Suppose there is a continuous group G of small motions that have been 

declared to be irrelevant to the physics of the objects. This could include both the isometry 

(or kinematic) group of space and internal symmetries of the objects themselves. Then the 

extension of the above Machian scheme is to demand 

general Rl that the action be invariant under G-transformations, and 

R2 be unaffected by any overall choice of time for the whole universe. 

The first of these is implemented as follows. 
BM [general Rl] would involve working not on the configuration space Q(A) but rather 

the quotient space RCS(A) = ■ 

(305) 

by the indirect means of correcting the bare velocities with auxiliaries corresponding to the 
freedom of moving in the G-directions. Thus one passes to the 'doubly degenerate' Q(A) x G 
but then the constraints corresponding to variation with respect to the auxiliaries ensures 
that G is physically irrelevant. 

I take this implementation to be built on an underlying implementation 
AF [general Rl] : the action is to be written in an arbitrary G-frame and so in terms of 

the 'corrected coordinate' = ^'a^'pA laA 

(306) 

where greeks are manifold-multi-indices, Greeks are internal or field-type multi-indices, Ca 
is the obvious product of frame bases with respect to which the A-th object has components 
QaAj Ei'iid Laj.j arc the coordinate transformations of G corresponding to generators associated 
with the auxiliary variables ajjj , From this it follows that the auxiliaries involved are G-frame 
velocities. 

The second is implemented as 
RI[R2] : the use of manifestly Rl actions (of the usual Jacobi type or some suitable 
generalization homogeneously linear in the velocities). 

According to AF [general Rl], the potential should then be built to be G-invariant, to 
which end it is useful if a collection of G-invariant or G-covariant objects are known. Such 
G -concomitants are available for all the examples in this chapter (but not in III. 2). The 
kinetic term is trickier since ^ is not even G-covariant due to the involvement of the frame 



[i]e generators of G(A) P] 

(307) 

Thus use of an arbitrary G-frame in building the action amounts to passing from the bare 
stacked velocity to the unstacked, BM velocity (^^j = ^a.[i\QA Thus one has brought in 
generators which infinitesimally drag in all unphysical directions. This corresponds to keep- 
ing one configuration fixed and shuffling a second one around by means of the physically- 
irrelevant symmetries of the collection of objects as a means of casting the second configu- 
ration into as similar a form as possible to the first configuration. 



Variation with respect to each ajj] gives one secondary constraint. For homogeneous 
quadratic actions, these are hnear in the momenta. If these are satisfied, one has successfully 
passed from Q(A) x G to By Dirac's argument there still remains at least 1 primary 

constraint due to R2. There may also be additional secondary constraints from applying 
the Dirac procedure to the constraints. 

This procedure may be interpreted as in the previous section with the aj;] as auxiliary 
velocities corresponding to cyclic coordinates, T indeed remains RI. The interconversion 
with Hamiltonian Dirac-appending continues to hold, at least while the appended terms 
are linear in the appending Lagrange multipliers. 

Whereas mechanics is not usually considered in the above RCS fashion (which recon- 
ciles Newtonian mechanics with Leibniz's ideas at a small global cost), it so happens that 
conventional electromagnetism and GR can be thought of as working along these lines. I 
discuss these in greater depth in App ILB (generalized to the Yang-Mills case) and in IL2.2 
respectively. 

2.2 Relativity without relativity 

2.2.1 Local square roots 

The configuration space is 

Q(GR) = Riem = {set of 3- metrics on a fixed topology, taken to be CWB} 



The RCS is RCS(gr) C {Superspace} - {Riem} 



{3-Diffeomorphisms} 

(308) 

though this is only part of the way to wards isolating a representation of the true dynamical 
d.o.f's of GR (the Hamiltonian constraint remains and is problematic). 

This is to be implemented indirectly by BM with respect to the 3-diffeomorphisms: 

hij — > fi^/ijj = hij — £^hij = hij — 2L)(j^j) . (309) 

So for any two 3-metrics on 3-geometries Si, T,2, this corresponds to keeping the coordinates 
of Si fixed whilst shuffling around those of S2 until they are as 'close' as possible to those 
of Si. 

The new ingredient in the RWR paper is to implement the lack of external time by using 
a RI action with a local square root ordering. For particle mechanics, this would mean using 



E(i)=(i) ^^{i)9:(i)%) rather than the global square root ordering y E(i)=(i) ^{i)^(i) ' %)■ 
If one has a field theory in place of a finite number of particles, the sums are replaced by 
integrals. For gravitation, BFO thus consider BSW-type actions, taken to be of form 

iBSW-type = j^^j dx'^'/hVPVr , (310) 

as opposed to BB82's global square root actions. Although the square root is local in that 
it sits inside the integral over all space, there is also a 'little sum' over the indices hidden 
within the kinetic term T in (|310|) . 

The global analogue of GR (the original BB82 geometrodynamics), has the action 




iBSW-type = / I d'xVhJ^J I d'xVhR , (311) 



^^FoUowing |233| . mention of BSW was included in BB82 but such local square roots were not studied 
until BFO's paper. 



whose global square root then gives one constraint only rather than the one constraint per 
space point of GR. Note that this global alternative would arise from the ADM Lagrangian 
by the BSW procedure were one to assume a is independent of Xj. 

Note that the suggestion in II. 2. 1.1 about treating auxiliaries may be applied in GR 
to to have an action that is homogeneous of degree 1 in the velocities and hence repa- 
rameterization invariant, we can treat it as an auxiliary velocity to which free end-point 
variation is applied. 

One of Barbour's conceptual ideas behind the TSA is that such RI actions could be con- 
sidered as geodesic principles on configuration space: the reduction of the physical problem 
of motion to the geometrical problem of finding the geodesies of the configuration space 
geometry. This followed from the insight of homogeneous-quadratic Newtonian mechanics, 
for which use of Jacobi's principle reduces the problem of motion to a problem (conformal 
to) the well-defined, well-studied problem of finding the geodesies of the Riemannian config- 
uration space geometry. But I dash his original hope that something similar would happen 
in GR in VI. 1. 

The BSW formulation of GR and the Jacobi formulation of mechanics are similar in 
that in each case the presence of a square root means that the Lagrangian is homogeneous 
of degree 1 in the velocities. Thus primary constraints exist by Dirac's argument; specif- 
ically here the square roots give rise to primary constraints by 'some version or other of 
Pythagoras'. 

The standard (global square root) Jacobi case is just the usual bare version of the 
previous section. The global square root gives 1 constraint Vq in total and then ELE's 
imply that Vq ~ 0. Also for the choice of A such that T = E - V holds, the bare version of 
(|l^()l|) reduces to the form of Newton's Second Law. This special A has the same properties 
as Newton's absolute time. If there is no external time, then E = T -|- V is to be interpreted 
not as energy conservation but as an emergent definition of Newtonian time. 

In contrast, local square roots give many primary constraints: one per object summed 
over (1 per space point in the continuum case). Typically applying the Dirac procedure 
to many primary constraints will give many more secondary constraints which use up all 
the d.o.f's. This overconstraining due to the local square root led BFO to call it the 'bad' 
ordering and the global square root the 'good' ordering. 

The local square root homogeneous quadratic mechanics in which each particle has its 



own potential would be S'Mech = 2 / dA ^ ^J (E(j) - , 

W={i) 

(312) 



(i) p • p 

which gives one constraint = = (E/j-i— V/n) = 

2m(j) 

(313) 

per particle, which then gives a tower of constraints rendering the theory inconsistent by 
using up all the d.o.f's. A simple special case avoiding this can however be found by 
inspection of the primary constraint (|313j) : the noninteracting case in which each particle 
moves in its own potential alone. 

In the case of the BSW Lagrangian one obtains an infinity of primary constraints: 1 per 
space point. As we show below however, despite GR having the bad ordering, it turns out 
not to be overconstrained. 

The BSW action may be written as Ibsw ~ J J 'i^2;\//iV'^VT^ , 

(314) 



where the gravitational kinetic term is given by = —=G°'^'^'^%^hab^^hcd , 

(315) 

for G^^""^ = Vhih^''h'"^-h''^h'"^) 

(316) 

the inverse of the DeWitt supermetric, and where I am now regarding it as an action 
constructed to meet appropriate relational demands general Rl and R2, as emphasized 
by making use of ^* for the relational auxiliary, and for the emergent lapse to distinguish 
these from the presupposed spacetime notions of shift and lapse, /3* and a. I will also regard 
^* as some where are the coordinates and I am building an x'-invariant action; care 

is needed particularly with - : -/.^ = " ^D^aCt) J > = • 

(317) 



The canonical momenta (defined at each space point) are p*-' = = y ^^G*-''^'^fi^/icd 

(318) 

R 



for 2N = a/-^- At each space point 



2N ^ 2N' (2A^)2 



so the local square root gives one primary constraint 7i = Gijkip^^p^^ — VhR = . 

(320) 

In addition, ^*-variation (treated as a Lagrange multiplier) of the BSW action leads to the 
momentum constraint 7ii = —2Djp'^ = as a secondary constraint. It is unsurprising that 
we get this since in starting with a 3-space ontology we are allowed to provide the BM 
correction with respect to 3-diffeomorphisms. Thus we get the momentum constraint quite 
on purpose. Note also that H32U|) may be identified as the Hamiltonian constraint of GR, 
Ti.. Whereas the action (|314)) is associated with curves on the space Riem x H, where H is 
the space of the if the momentum constraint can be solved as a p.d.e for (the thin 
sandwich conjecture of 1.2.9.1), the action will depend only on the curve in Superspace. This 
follows from the constraints being free of and the three components of the momentum 
constraint reducing the number of d.o.f 's from the 6 of Riem to the 3 per space point in a 
3-geometry. 

The ELE's are (EHI) and = 4^ = aVhNih'^R - R'^) - ^ (p'"'pj - -p'^p] 

ohij Vh \ 2 / 

+ aVh{DWm - h'W^N) + £^p'^ . (321) 

This is indeed GR, for which it is well-known by the contracted Bianchi identity (|29|) that 
both constraints propagate without further secondary constraints arising. Furthermore, 
in this thesis, the momentum constraints are automatically propagated as a further con- 
sequence of the action being deliberately constructed to be invariant under X-dependent 
3-dijfeomorphisms. 



In the present case this momentum constraint propagation takes the form 



ni = -^D^ (N^n) + £^ni . (322) 

The Hamiltonian constraint propagation takes the form 

1 Nt) 

n = -—D^ [N'^T-ij] + -^n + £^n . (323) 

(c.f 1.2.6, II. 1). Thus, unHke in mechanics, this system is not over-constrained. From the 
3 + 1 perspective, the reason that GR is not over-constrained is that it possesses a hidden 
foliation invariance. Whereas at first glance, one would expect the BSW action to be 
invariant only with respect to the global reparameterization A — > A' (A) for A' a monotonic 
arbitrary function of A (in accord with Noether's theorem), in fact the action is invariant 
under the far more general local transformation 

fix' 

A A'(A), hi.ix, A) hi.ix, A'), Ux, A) — e*(x, A) . (324) 

This is the foliation invariance of the 3 + 1 split of GR - the hidden symmetry associated 
with the Hamiltonian constraint 7i. In contrast mechanics has just the global A — > A' (A) 
invariance unless the aforementioned restriction is applied. 

2.2.2 Uniqueness of consistent BSW-type actions 

We know [by virtue of the contracted Bianchi identity (|29jl ] that GR will work as a constraint 
algebra. The question then is does anything else work? 

BFO's idea was to postulate 
BM [general Rl] : the best matching rule is used to implement the 3-d diffeomorphism 
invariance by correcting the bare metric velocities hij — > 6g/ijj = hij — £^hij. This rule 
will be applied in IV to the velocities corresponding to all the matter fields "I^a as well: 
^'a — > = ~ £(^A so it is a universal rule. For everything in this chapter, this 

can be taken to arise from the corresponding AF [general Rl]. 

RI[R2]: a local square root reparameterization-invariant implementation is used. The 
pure gravity actions considered are of Baierlein-Sharp-Wheeler (BSW) type with T homo- 
geneously quadratic in the velocities and ultralocal in the metric. 

My analysis below however differs from BFO's since they missed out a number of possibilities 
(as explained below). My trial BSW-type action is 

iBSW-type = / dA y d'xVhVaR + A^/t^ , T^y = ■^Gt"'^habQh,d , (325) 
where G%^^ = Vh{h'''h^^ - WW^h^^) , W , 

(326) 

is the inverse of the most general (invertible) ultralocal supermetric (|215j) . and without loss 
of generality a G {—1,0, 1}. More general potentials are discussed in II. 2. 2. 5. 

., ,. , . jj - _ VhY ^ij^d. 



Setting 2N = \J , the gravitational momenta are p-' = — ■. — = ^^cd ■ 

(327) 



The primary constraint 7i = —j= (pop \ — \fh{aR + A) = 



(328) 



then follows merely from the local square-root form of the Lagrangian. In addition, 
variation leads to a secondary constraint which is the usual momentum constraint ()84|). 



The ELE's are = ^ = VhNh'^{aR + A) - VhaNR'^ - (p'^^pj - ^p'^p] 

ohij \/h \ 2 / 

+ Vha{D'Dm - h'W^N) + £^p'^ . (329) 
The propagation of TC then gives [Sj 

n = ^D\N^m) + + £^ + 2 (1 _ x)YaDi (N^D'p) . (330) 

The first three terms of this are related to existing constraints and thus vanish weakly in the 
sense of Dirac. However note that the last term is not related to the existing constraints. 

It has 4 factors which could conceivably be zero: 

(331) 

Any of the first three factors being zero would be strong equations restricting the form 
of the ansatz. The fourth factor might however lead to new constraints and thus vanish 
weakly. 



2.2.3 Interpretation of the consistency condition 



The question posed in RWR - and answered in the affirmative - is whether GR can be 
derived solely from 3-d arguments, that is, without any use of arguments involving 4-d 
general covariance (spacetime structure). The approach succeeds because of the need to 
propagate Ti. acts as a powerful filter of viable theories, which are already strongly restricted 
by Tli being tied to 3-diffeomorphism invariance. This means that one has only 2 d.o.f 's per 
space point to play with. The remarkable invariance at the end of 1.2.2.1 does not usually 
hold for the generalization (|325|) of the BSW action. 

We require the term H331|) to vanish in order to have a consistent theory. Unless one 
wishes to have a theory with a privileged slicing, constraints must be independent of N. In 
this case the fourth factor requires Dip = ^ = C(A). But this new constraint must 
also propagate. This leads to a nontrivial lapse-fixing equation (LFE) which (if soluble) 
gives a constant mean curvature (CMC) foliation. The LFE is 

1^ (_^) = BiX) = SAN + 2aiNR - D^N) + "^f , (332) 

where B{X) is a spatial constant. Note that for cr 7^ this is a nontrivial equation for the 
lapse N. It is the standard CMC LFE (0H7|) in the GR case {a = Y = W = 1). 

So if neither cj = nor y = 0, the DeWitt {W = X = \) supermetric of relativity is 
enforced, which is BFO's 'Relativity Without Relativity' result. Note that the Lorentzian 
signature {a = 1) of GR does not drop out of this working; one can just as well obtain 
Euclidean GR (cj = —1) in this way. Earlier work of Giulini already noted that the W = 1 
supermetric is mathematically special |156j . 

But there are alternatives arising from the second, third and fourth factors. The study 
of any new alternatives arising this way is motivated by how these arise from an exhaustive 
route to GR. Can we overrule them and thus arrive uniquely at GR, or do serious alternative 
theories arise? In addition, these alternative theories have interesting properties which we 
use to motivate their study in their own right. These motivations are presented at the start 
of the treatment of each theory in III and V. 

The £7 = branch works for any W. This is a generalization of strong gravity |2()()| 
nW)[ EHl UnZl UMl imi which is strong-coupled limit of GR in the W = I case. The other 



theories are not related to GR, but are related to scalar-tensor theories; strong gravity 
theories are argued to be relevant near singularities (see III.l for all of this). The HKT 
program would discard the strong gravity theories since they are not a representation of 
the Dirac Algebra (although Kuchaf |235j and Teitelboim j352j did elsewhere study strong 
gravity). However, the strong gravity theories meet the TSA's immediate criteria in being 
dynamically consistent theories of 3-geometries. 

The fourth factor of I^Ti} leads to conformal theories gOl CIl EIH EEl [12]. These 
theories now have privileged slicings, which happen to correspond to those most commonly 
used in the GR IVP: maximal and CMC slices. 

For example, our conformal gravity j4Ul lllj has the action 



dA 



+ 



' c 



vi<p) 



volume V 



-r 



abed 
'{W =0) 



ab 



d^xVhcj)^ (333) 



-h 



cd 



(334) 



which is consistent for a = 1 because it circumvents the above argument about the fourth 
factor by independently guaranteeing a new slicing equation for the lapse. Despite its lack 
of 4-d general covariance, conformal gravity is very similar to CWB GR in that CWB GR's 
dynamical d.o.f 's may be taken to be represented by [3891 139UI I391j 



CS + V = Conformal Superspace + Volume 



Diff X { Volume-preserving conformal transformations} 
and conformal gravity arises by considering instead 

Conformal Superspace = -— t-- — - — ^ . (336) 

Uitt X {Conformal transformations) 

This has an infinite number of 'shape' d.o.f's whereas there is only one volume d.o.f. Yet 
removing this single d.o.f changes one's usual concept of cosmology (see V.2), and ought 
to change the Problems associated with the quantization of the theory (by permitting the 
use of a positive-definite inner product and a new interpretation for Ti) [nj. One arrives 
at further 3-space CS+ V theories if one chooses to work on (|335|1 instead of (|336|) [TTl [T^ 
whilst still retaining a fundamental slicing. There are yet other conformal theories (III. 2. 6, 
V.2.2). 

I included the Y, rather than scaling it to 1 like BFO, to make clear my recent insight 
into how a 'Galilean' branch arises for y = i.e when the gravitational momenta completely 
vanish in the Hamiltonian constraint. From this as well as the Lorentzian branch arising, 
it becomes clear that the condition that (|331j) vanishes is closely related to the choice of 
postulates that Einstein faced in setting up special and general relativity. This point is best 
further developed in V once matter has been introduced. 

To mathematically distinguish GR from these other theories, I use 
TSA GR postulate 3 : the theory does not admit privileged foliations and has signature 
e = -a = -1. 

My future strategy will involve seeking to overrule the conformal alternatives on fundamen- 
tal grounds, by thought experiments or by use of current astronomical data, which would 
tighten the uniqueness of GR as a viable 3-space theory on physical grounds. If such at- 
tempts persistently fail, these theories will become established as serious alternatives to GR 
(see V.2). 



2.2.4 Deducing BM from BSW alone 



O Murchadha j282j and I [31 separately considered the possibility of starting off with 'bare 
velocities' hij rather than BM ones hij — £^hij, to provide a variation on the 3-space theme 
in which the spatial relationalism is emergent rather than assumed. My generalized version 
(including o", Y , W and A) works as follows. I obtain a bare momentum 

and the usual Hamiltonian-type constraint H328|) arises as a primary constraint from the 
local square root. The temporary ELE's read 

= ^ = VhNh'\aR + A) - VhaNR'^ - (p'^^pj - ^p'^p] 

oliij \/h \ ^ J 

+Vha{D'Dm - h'W^N) , (338) 
which permit the evaluation of the propagation of 7i 

n = -^Da{N'[D,p'^'' + iX- 1)D>]) + . (339) 

For neither a = nor Y = 0, 1 then obtain the secondary constraint 

Sa = Dbp\ + {X- l)DaP = . (340) 

But propagating this gives 

4 = V-H(X - 1) («„ [2.(NR - D^N) + 3A.A + _ " f'^ O.p) 

-^D^N^-H) , (341) 

SO if constraints alone arise (rather than conditions on N imposing preferred foliations), I 
require X = 1 and recover relativity since the secondary constraint —2Si = becomes the 
momentum constraints^ 

Hi = -2DjPi^ = 0. (342) 

One then argues that this 'discovered' constraint may be 'encoded' into the bare action 
by the introduction of an auxiliary variable It is then this encoding that may be thought 
of as the content of BM. This is similar to how the emergent gauge-theoretic Gauss con- 
straints are treated in IV. 1. One then re-evaluates the momenta and ELE's, obtaining the 
GR ones ()318p and 1)321(1 in place of the above temporary bare ones ()337() and (|338() . 

For o" = or y = no additional constraints arise. Thus it is possible to 'miss out' 
rather than 'discover' constraints by this integrability method above (further discussed in 
III. 1.2). Thus although the 'bare' and BM schemes are equivalent for pure GR j282j . they 
are not in general equivalent 0. Another example of this is 'bare Barbour-Bertotti theory' 
in which Ai = C = are not recovered, and hence is just Newtonian mechanics. 

2.2.5 Higher derivative potentials 

Using the potential P = aR + A assumed in 1.2.3 amounts to applying a temporary 



^That Hi is an integrability of Ti. was already known to Teitelboim and Moncrief |275l 134^ . 



TSA gravity simplicity 4: the pure gravity action is constructed with at most second- 
order derivatives in the potential, and with a homogeneously quadratic BM kinetic term. 
Furthermore, BFO considered potentials that are more complicated scalar concomitants of 
the 3-metric hij than the above: P= and P= CiR"^ + C2R o R + C^D'^R (the most 
general fourth-order curvature correction in 3-d because of the Gauss-Bonnet theorem). 
Among these the potential of GR alone permits the Hamiltonian constraint to propagate. 
Also, recently, O Murchadha j283j considered actions based on matrices Mab{xi,X) and 
their conjugates P"'^{xi,\) which contain all terms in the former with the use of up to 
two derivatives and are ultralocal in the latter. In this bare approach, he recovers the 
combination R{Mab) for the form of the potential (along with the strong and conformal 
options), as singled out by the propagation of the local square root constraint. 

The first difficulty with the above methods is that they are not as yet systematic order- 
by-order, whereas HKT were able to use induction to dismiss all alternative potentials to all 
orders. Furthermore, HKT's proofs rest on the extra structure assumed, so similar proofs 
cannot be easily envisaged for the TSA. The second difficulty is that other known theories 
are in fact implicitly excluded both in BFO's and in HKT's work (further discussed in VI. 1.5 
along with other difficulties). Finally, HKT's result is known to be dimension-dependent 
since Lovelock gravity also reproduces the Dirac Algebra |353j . 

2.2.6 The inclusion of matter 

Just as Teitelboim |3481 1351j 'added on' matter so as to strengthen HKT's answer (see 
App IV. C), we have strengthened BFO's answer so. Our first works j38| llOj appear to give 
some striking derivations of the classical laws of bosonic physics. Rather than being presup- 
posed, both the null cone structure shared between gravitation and classical bosonic matter 
theories, and gauge theory, are enforced and share a common origin in the propagation 
of the Hamiltonian constraint. Thus electromagnetism and Yang-Mills theory are picked 
out. Gauge theory arises in this picture because one discovers the gauge-theoretic Gauss 
constraint as a secondary constraint from the propagation of the Hamiltonian constraint. 
This is then encoded by an auxiliary field which occurs as gauge-theoretic BM corrections 
to the velocities. 

The matter considered was subject to the 
TSA matter simplicity 5: the matter potential has at most first-order derivatives and 
the kinetic term is ultralocal and homogeneous quadratic in the velocities. Apart from the 
homogeneity, this parallels Teitelboim's main matter assumptions. 

As explained in VI. 2-3 there is a further tacit simplicity hidden in the 'adding on' of matter. 
This is linked to how including general matter can alter the gravitational part of the theory. 
This issue is tied both to the relationship between the TSA and the POE (see VII. 2), and 
in my contesting BFO's speculation that the matter results "hint at partial unification" 
(see VI, VII). 

3 Spacetime or space? 

Having built up a theoretical 3-space scheme working in turn with point particles, gauge 
fields and 3-geometries, I now argue for its merit and compare it with Einstein's route, 
ADM's formulation and HKT's route. 

I begin by elaborating on Einstein's motivations for his route to GR and then explain 
how his route neither directly implements these nor is in accord with the dynamics-centred 
development of the rest of physics. The first of these points is a relationalist's complaint 
for which RWR offers a peaceful reconciliation: the direct implementation also gives rise to 
GR. The second point has become quite a tension, as explained below. 

Einstein had several goals |128j during the years that he created special and general 
relativity. The first, realized in special relativity, was to reconcile electromagnetism with 



the universal validity of the restricted relativity principle RPl. In contrast to Lorentz 
j257j . who explicitly sought a constructive theory jl24j to explain the Michelson-Morley 
experiment and the relativity principle, Einstein was convinced that the quantum effects 
discovered by Planck invalidated such an approach |128j . p. 45. He "despaired of the 
possibility of discovering the true laws by means of constructive efforts" |128| . p. 53, 
and instead adopted RPl as an axiomatic principle jl24j . Notably this reconciliation 
involves adopting the physically-motivated Lorentzian RP2 in place of the absolute-time- 
motivated Galilean RP2. But whereas one has a new set of privileged frames (SR inertial 
rather than Newton inertial), 

1) one is still giving distinct status to configurations which are relationally identical; 

2) there is still distinct unaccounted-for status being given to a privileged class of frames. 
Einstein's further goals were to free physics of absolute space ('implement Mach's principle', 
see below), and to construct a field theory of gravitation analogous to Maxwell's electro- 
magnetism. Encouraged by his treatment of RPl as a principle to be adopted rather than a 
result to be derived, Einstein generalized it to the general relativity principle GRP, accord- 
ing to which the laws of nature must take an identical form in all frames of reference. The 
GRP was eventually implemented as the 4-d general covariance of a pseudo-Riemannian 
dynamical spacetime. 

Whereas Einstein's route addresses 2) via the equivalence principle and the local van- 
ishing of the connection which is in turn influenced by dynamical geometry that is itself 
influenceable, 1) is apparently disregarded. Also, in making spacetime the arena of dy- 
namics, Einstein broke radically with the historical development of dynamics, in which the 
configuration space and phase space had come to play ever more dominant roles. Both of 
these played decisive roles in the discovery of quantum mechanics, especially the symplectic 
invariance of Hamiltonian dynamics on phase space. Since then, Hamiltonian dynamics has 
also played a vital role in the emergence of modern gauge theory |112j . In fact, spacetime 
and the canonical dynamical approach have now coexisted for almost a century, often cre- 
atively but also not without tension. This tension became especially acute when Dirac and 
Arnowitt, Deser and Misner (ADM) |112| I19j reformulated the Einstein field equations as 
the constrained Hamiltonian dynamical system 1)831 1841 11881 1189() describing the evolution 
of Riemannian 3-metrics hij Dirac was so impressed by the simplicity of the Hamiltonian 
formulation that he questioned the status of spacetime, remarking "I am inclined to believe 
. . . that 4-d symmetry is not a fundamental property of the physical world" jllUj . Wheeler 
too was struck by this development j376j . and contributed the thoughts and terminology of 
1.2.7. 

However, the hitherto unresolved status of the one remaining redundancy due to the 
Hamiltonian constraint, which can only be eliminated at the price of breaking the spacetime 
covariance of GR, is probably the reason why neither Dirac nor Wheeler subsequently 
made any serious attempt to free themselves of spacetime. In particular. Wheeler rather 
formulated the idea of embeddability, i.e, that Riemmanian 3-geometries always evolve in 
such a way that they can be embedded in a 4-d pseudo-Riemannian spacetime |376| . which 
led Hojman, Kuchaf and Teitelboim J186' to their seventh route to general relativity. 

Contrast this, and Einstein's indirect approach through generalization of RPl to GRP, 
with the TSA first principles, which constructively implement Mach's idea of a relational 
dynamics! Whereas Einstein tried to incorporate his interpretation^'^ of Mach's principle 
into GR, that interpretation and GR were found not to be satisfactorily compatible. But 
CWB GR is Machian in Barbour's (or Wheeler's) sense, in addition to being a spacetime 
theory. 

The primality of spacetime or space constitute two different ontological viewpoints. In 
presupposing embeddability, HKT assume the habitual spacetime viewpoint of relativists, 
whereas Barbour defends the space viewpoint. In the former, there are two presuppositions 

^''Mach's principle was left open to diverse and misunderstood interpretations |29l I41| . 



about how 3-geometries stack: spatial stacking and foliation invariance. These presupposi- 
tions become manifest in Kuchaf 's general decompositions of spacetime objects with respect 
to spatial hypersurfaces (see VI. 2). In the latter, spatial stacking and RI are presupposed. 
One then sees that there are alternatives to the spacetime foliation invariance presuppo- 
sition in the form of the alternative theories of III (even if these are distasteful), and it 
becomes interesting to see if these can realistically describe the universe. Not all Machian 
theories are spacetime theories! 

Supportive arguments for BM theories have recently been put forward by Brown and 
Pooley |3nij for mechanics and by Pooley |3nnj for relativity. An argument against it (in 
mechanics) is that relationalists have it easy when C = jll8j . But C = may be argued 
to be a prediction of BB82 theory: a restricted subset of Newtonian mechanics emerges, so 
the BB82 theory is good in the sense of being more restrictive and thus in principle more 
readily falsifiable than Newtonian mechanics. I emphasize that within geometrodynamics 
the debate ends up being "spacetime versus space" and not the full "physical objects plus 
container versus objects alone" of the absolutist versus relationalist debate. Reality is 
ascribed to structure, but this is to the 3-geometry and not to space(time) points. 

Before the discussion of spacetime versus space, I require the introduction of matter 
(IV, VI). I then show how the space scheme works in V.l (including the emergence of the 
Relativity Principle), and how split space-time works in VI. 2. These are contrasted in VI. 3. 
The status of the POE in these schemes is explored in VII. 2. 

The two viewpoints are suggestive of how one approaches quantum gravity and its 
Problem of Time (in VIII). Internal time is based on belief in spacetime (as are formulations 
in terms of sum over histories, generally-covariant histories, spin foams and causal sets), 
whereas not specifically (spacetime) generally covariant canonical approaches and Barbour's 
hope in the naive Schrodinger interpretation are space-based. The question is then: was 
spacetime a historical accident in that, because of being a very fruitful approach in classical 
GR and in quantum particle physics in a Minkowskian background, we have been overly 
bedazzled by spacetime and consequently misled by it in our attempts to quantize gravity? 

Barbour also put forward his classical and quantum ideas in [201 1^ and the popu- 
lar book These have been critically discussed by Butterfield (TDJ and Smolin |338j 
largely from a quantum mechanical, philosophical perspective. In contrast, my criticisms 
in V, VI and VII concentrate on the later technical papers, from a classical, mathematical 
perspective. 

App II. A : Deriving the relativity without relativity result 

For the arbitrary ultralocal supermetric, one can build up the following ^*-free parts of 
results: 

hij = —j=- i^Pij - —phijj , (343) 
h'^ = -'^^ ( p'^ - —phA , (344) 
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VhR = -2NY i^p'^ - yP^'j + '^YDjDiNp'^ + 2[X - l)YD'^Np 

+^D,iN''D,p'^) + '^^^~^^^ D\N^Dip) . (348) 

These (along with the metric ELE) can be composed by the chain-rule to form the prop- 
agation of the Hamiltonian constraint, and hence weakly obtain the consistency condition 
13311 fwhich includes the RWR result), from the last term of H348() . One can similarly obtain 
the working for the propagation of the momentum constraint. 

The part of the Hamiltonian constraint propagation is simple once one realizes that 
£^R is simply ^^diR [which could also be derived longhand using the symmetry of Rab, 
the Ricci lemma (jllj) and the contracted Bianchi identity H29|) ]. By similar techniques, the 
.^*-part of the propagation of the momentum constraint is 

D,i£^p'^)+ti.\^pi'^£^iDjp'^) = UDjD'f^ - D^D.p'^) +pi^DjDke - 0^0^^) 

+ pi\DjD'^k-DWj^k) 

= URkidP"' + Rkj\p'^) + p'^Rkj^i" + P^'R'jkd^" = . 

(349) 

App II. B 3-space approach and gauge theory 

I go beyond BB82's treatment in including an arbitrary gauge group G rather than U(l), 
and in considering a local as well as a global ordering. Given a configuration space that 
is a collection of 1-forms (where I running over 1 to K are internal indices) which are 
functions of position on and of label time A, and which are invariant under small 



G-transformations, I require general Rl: to work on the RCS(74^) — 



G X Eucl 

(350) 

N.B I could also (more standardly) proceed without involving Eucl. 
R2 The choice of time label is not to affect the system. 

I use the indirect implementation BM [general Rl] by passing to Q{A^) X Gx Eucl 
by best matching each velocity. The electromagnetic BM is with respect to the U(l) gauge 
transformations A — > A — dA. So for any two 3-d 1-form fields on a flat space, one without 
loss of generality keeps one in a fixed gauge and changes the gauge of the other until the two 
1-form fields are as close as possible. Thus overall Aj — > fiA,a,bAj = Aj — dA^ + gc[A, A]i — 
aj — b X Aj, where a^ is K copies of a, and both a^ and b are independent of position on 

I first use the global square root reparameterization-invariant implementation RI[R2] 



'yM(G) = / "^^V " / d'^^M^f ) / d'^fiA,a,bAi • fiA,a,bAl • (351) 



Defining 2N = / (E If^i axAi+g[A,A]i)2) ^ ^^^^ momenta are ttj = 27V^6AabAT. Varia- 

y /d'^a;BA,a,bArDA,a,bA ' ' 

tion with respect to the auxiliaries Ai(2;), a^ and b respectively yields 

(352) 
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the Yang-Mills Gauss constraint D -tti = , 



M 



j d^x^vr^ , (353) 



There is additionally a primary constraint from the global square root: 



(354) 



(355) 



The ELE's are vrf = D9{2N'^Ff^) = 0. These guarantee the propagation of the con- 
straints. Now, it is required for the time-label 



A to be such that 



F^gFf^ + fiA,a,bAi • 6A,a,bA' 



(356) 

in order to recover the Yang-Mills field equations. One further does not usually correct with 
respect to Eucl, but the standard form of gauge theory precisely corresponds to G-BM. 



I next consider the local square root ordering. 

'Vm(G) = I'^^I - ^M^f )fiA,a,bAi • 6A,a,bA' . (357) 



Defining 2N = w- . ^"L ^ — tt, the field momenta are now vtj = 2A^fiA,a,bAi- As above, 

V JjA,a,bAi-]JA,a,bA 

variation with respect to the auxiliary variables yields H352|) . ()353|) and H354|) . whereas there 



is now one primary constraint per space point, 'P(x) = vr|7r^ -|- Fp^F^'^ = E 

(358) 

due to the local square root. The ELE's are now vr? = (^2iVD[9^fl) . These propagate 
by standard energy-momentum conservation: V gives 

Sp = (Ejx B\ = ttIfIp = 4 {dpA\ - dqAl + C^j^A^Af) = (359) 

as a secondary, whose propagation involves no new constraints. It is true that now this is 
locally rather than globally restrictive and thus only gives a small fragment of conventional 
electromagnetism or Yang-Mills theory. Similar thoughts play a corrective role in the 
universal null cone result (see V.l). This difficulty goes away in GR coupled to these 
matter fields because of the role of the momentum constraint. Thus, GR permits more 
complete Machian matter field theories. Finally, I note that the Hamiltonian constraints 
arise in place of labelling conditions that ensure the recovery of the standard form of the 
equations of physics. There are to be no such privileged labellings in GR! 



Ill: Alternative TSA theories 



1 Strong gravity alternatives 

" ' Well, in our country' said Alice ... 'you'd generally get to somewhere else if you run 
very fast for a long time ' " Lewis Carroll [77j 

1.1 Introduction 




Carrollian limit 



CO 

Galilean limit 



Figure 9: 



If one adopts Einstein's RPl, there is then actually a third choice^^ (in addition to the 



Galilean f . 



prop 



oo and the Lorentzian V' 



prop 



finite): the Carrollian RP2 [MS] 



i^prop 



0. 

Whereas in the Galilean limit the Lorentzian light-cone becomes squashed into a plane 
of simultaneity due to the infinite speed of physical signals, in the Carrollian limit the 
Lorentzian light-cone becomes squeezed into a line as points become entirely isolated due 
to the zero speed of physical signals (see fig 9). Thus the Carrollian universe consists of 
entirely isolated worldlines. The most common interpretation of this is as ultralocal physics 
(no spatial derivatives). Klauder |218l 1219] has studied ultralocal scalar field theory as a 
non-renormalizable but nevertheless controllable quantum field theory. 

Strong gravity is the regime of GR in which KoK — K^ <C R which may be considered to 
be letting G — > oo, or alternatively letting Vprop = c — > |352j . This was first considered 
by Isham |2UUj so as to be a starting point for a nonstandard perturbative theory of quantum 
gravity along the lines of Klauder's method above. There are actually two forms of strong 
gravity 



KoK-K^ 



A 



(360) 



and 



KoK-K^ = A 



-D^K = 



(361) 

which are thus 5 d.o.f and 2 d.o.f theories of gravity respectively. I retain a 'cosmological 
constant' A to ensure the theory is nontrivial. Strictly, one is taking the G — > oo limit while 
keeping ^ constant to arrive at these theories. Strong gravity is locally Carroll invariant 
just like GR is locally Lorentz invariant. 

Note how these are written in 'split' form. Now, strong gravity has a degenerate metric 
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®^ Another source of choices might be to introduce somewhat more comphcated structures such as in 
noncommutative geometry approaches. 



This gives an unusual geometry, as studied by Henneaux |18Uj . One needs to use a weaker 
notion of inverse than usual, leading to only a very limited notion of parallel transport. The 
extrinsic curvature Kab becomes merely the ab components of a 4-tensor K^^ rather than 
being the whole of Kab, and (|361() cease to be the OA components of a 4-tensor equation 
(the EFE's). There does not appear to be a clean interpretation of strong gravity as a 
4-tensor theory - spacetime structure does not appear here as it does in GR. 

What use is strong gravity? It approximates GR near the cosmological singularity, since 
it gives an independent Kasner universe at each spatial point, which is the conjectured 
behaviour of the general solution of GR (see 1.2.7.1). Thus it is a worthwhile regime to 
quantize [2971 12981 1142j . The notion of strong gravity is related to the two most popular 
approaches to quantum gravity as follows. It is analogous to the tensionless string j3241 
EHIESS, 330, 383 , and it admits an Ashtekar variable formulation pHl I193j (see III. 1.5). 

In the TSA, we consider strong gravity as a dynamically-consistent theory of evolving 
3-geometries on its own merit, obtained as an alternative to GR. My more careful analysis 
revealed that the 3-space approach gives rather a 1-parameter family of strong gravity theo- 
ries. Their discovery provides a different answer to Wheeler's question from the uniqueness 
of BFO, in the case of the strong-coupled limit cr = 0: there is a consistent theory not only 
for the W = 1 DeWitt supermetric of the usual strong gravity, but also for any ultralocal 
invertible {W |) supermetric. Whereas the use of initially bare or initially BM velocities 
in the GR case of the TSA does not affect the final form of the emergent theory, in the 
strong gravity case the outcome is affected. I furthermore discuss how they can be related 
(as limits relevant to the very early universe) to the well-known scalar-tensor theories of 
gravity (III. 1.3), and used as toy models toward the study of conformal gravity (III. 1.4). 

Like Minisuperspace, strong gravity is a GR regime amenable to quantization. My the- 
ories provide an enlargement of this second arena, in which different permitted ranges of 
values of W permits considerable mathematical differences \156\ . In particular, for W < ^ 
one has theories with positive-definite i.p's. The study of these could broaden the un- 
derstanding of the i.p Problem of quantum gravity (see VIII). I furthermore discuss the 
possibility of the very early universe actually having a positive-definite i.p. I then show 
(III. 1.5) that the Ashtekar variable formulation, of potentially great use in quantization, is 
not readily available for W ^ 1. The strong gravities are studied as systems of equations 
in III.1.6. 

I also return to strong gravity in IV. 1.4, where I investigate the coupling to it of fun- 
damental matter. This enables fruitful comparison with matter coupling in the GR case, 
leading to better understanding of some of the GR TSA results. 



1.2 Strong gravity and the TSA: X is arbitrary 



We are interested in finding consistent theories of evolving 3-geometries; we use the TSA 

(363) 
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The canonical gravitational momenta are given by (|327j) with 2N = v/ 

The strong gravity Hamiltonian constraint follows as a primary constraint from the local 



square root form of the Lagrangian: 71^°^^ = —j= (pop p^j — \//iA = . 

(364) 

^'-variation yields the usual secondary momentum constraint H84|) . 
The ELE's are the obvious subcase of the ELE's (I32(^ 



VhNh'^A - ^ (^p'"^pj - ^p'^p^ + £i:f^, (365) 



From these one can evaluate Ti: 
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which vanishes weakly in the sense of Dirac. 

This corresponds to the a = possibility of ()33U|) . for which any ultralocal supermetric 
is allowed. The theory traditionally called strong gravity has 1^ = 1 because it is obtained 
as a truncation of GR. But I have now shown that there is a family of such theories as far 
as dynamical consistency is concerned. Because W = ^ is badly behaved, the family of 
dynamically-consistent theories naturally splits into W > ^ and W < ^ subfamilies. Such 
W < ^ theories should be simpler to quantize than more habitual W > ^ theories (including 
GR), because the former have a positive-definite supermetric, removing difficulties in defin- 
ing a suitable quantum i.p. A particularly simple example of such a dynamically-consistent 
theory is the W = theory, for which the constraints are pop = hA , Djp^^ = . 
W = may be of particular relevance, in part because conformal gravity has been formu- 
lated in terms of the W = supermetric, and in part from string-theoretic considerations 
(see III. 1.3). Conformal gravity arises because, in a conformal generalization of the above 
working, the equivalent of the slicing equation ()332() is independently guaranteed to hold. 
There is also a separate strong conformal theory Tlj (see III. 2. 6). 

Since for a = H33U|1 reduces to (|366() . the momentum constraint may no longer be seen 
to arise as an integrability condition. This fact was already noted by Henneaux jlSOj . Strong 
gravity thus provides a counterexample to the suggestion that all additional constraints need 
arise from the propagation of TC. However, all the other constraints can be interpreted as 
arising in this way in the standard approach to GR (see VI. 3-4): TCi, the electromagnetic 
Gauss constraint Q, the Yang-Mills Gauss constraint Qj and the 'locally Lorentz' constraint 
J^AB from working in some first order formalism. 

So strong gravity without a momentum constraint is also dynamically consistent |313j . 
and the new W ^ 1 strong gravities may be treated in this way too. In fact, it is this 
treatment that corresponds to strictly taking the G — > oo limit of GR (as opposed to 



Pilati's approach p96_s in which the momentum constraint is kept). This is because the GR 

momenta are proportional to j23j . One could also start off with a Tii, but use Ashtekar 
variables in place of the traditional ones. In this case, the analogues of 7ii and J'ab cease 
to be independent 

If one starts off with 'bare velocities' rather than BM ones, for cr = one obtains 



which vanishes weakly, and so no further, secondary constraints emerge, and one has the 
5-d.o.f 'metrodynamical' strong gravity theories. 

1.3 On the meaning of the theories: application to scalar-tensor theories 

The X ^ 1 departure from the DeWitt supermetric does not appear to affect Henneaux's 
study of the geometry. Whereas these theories are no longer interpretable as truncations 
of GR, they do correspond to truncations of scalar-tensor theories (such as Brans-Dicke 
theory), in a region where the scalar field is a large constant. The relations between the 
Brans-Dicke parameter uj and our coefficients W and X are shown in fig 10. I now discuss 
the possibility that a positive-definite {W < | i.e w < 0) inner product can occur in our 
universe. There is no point in considering Brans-Dicke theory, an action for which is 




{3X - 2)Np 
2Vh 



n . 



(367) 




(368) 



since this has to constant in space and time and we know from the recent analysis of Cassini 
data that today uj > 20000 |379j . corresponding to W being very shghtly larger than the 
GR value 1. However, general scalar-tensor theory permits co{x) so uj varies with space and 
in particular with time. So it could be that the very early universe had a very different value 
of CO from that around us today, since the bounds on lo from nucleosynthesis permit 
nucleosynthesis ^ 1 ^ rjij^^ bounds from 12151 are less strict but presumably applicable to a 

today ^ 

wider range of theories since the origin of the departure from W ^ 1 (i.e to ^ oo) is there 
unspecified. Furthermore, to is attracted to the GR value at late times in scalar-tensor 
theories |95| I96j . so it need not have started off large. One would expect tu of order unity in 
any fundamental scalar-tensor theory '95'. For example uj = —1 arises in low-energy string 
theory [l2j. 



G ijj^j noninveitible G '1^ noninveitible 




CO in low-energy string theory 



CO today in 
solar system 



Figure 10: Although the Brans-Dicke parameter uj > 20000 in the solar system today |52) . scalar-tensor 
theories permit u to vary and it is suggestive that ui could have been smaller in the early universe. If ui tends 
to or passes through zero, the corresponding strong gravity theory there would be considerably different from 
the strong gravity theory corresponding to GR due to the character of its supermetric. 



It is thus an open question whether uj at early times could have passed through the 
value 0. This question is interesting for the quantum-mechanical reasons given in the next 
paragraph. I first wish to clarify the role of my strong gravities in such a study. They do 
not permit W (and hence uj) to change with time, so I am not suggesting to use these to 
investigate whether such a transition through a; = is possible. But if such transitions are 
found to be possible, the very early universe could then be described by scalar-tensor theories 
which have uj < 0. Then one of my strong gravity theories which behaves qualitatively 
differently from the usual W = 1 strong gravity corresponding to GR would be relevant as an 
approximation near the initial singularity. I propose to study the possibility of having such 
a transition using the full scalar-tensor theories. Unlike their strong gravity limits discussed 
above, for which this transition involves passing through a noninvertible supermetric,^^ the 



^^In fact, this W = ^ supermetric is of the same form as the degenerate strong gravity 4-metric, so the 
pointwise geometry of Superspace for = ^ should be taken to be akin to Henneaux's geometry |180| of 



full scalar-tensor theories are not badly-behaved as uj 



0. This is because despite the 



degeneracy of the tensor ('gravity') supermetric for a; = 1, what counts for the full theory is 
the larger scalar-tensor supermetric. Just like the GR supermetric may be represented by 
a 6 X 6 matrix, the scalar-tensor supermetric may be represented by a 7 x 7 matrix where 
the new seventh index is due to the scalar field |216j : 



Now, this is well-behaved at u = because of its scalar-tensor cross-terms, due to which 
the degeneracy of the 6x6 block is not sufficient to cause the whole 7x7 scalar-tensor 
supermetric to be degenerate. But in the approximation by which the theories of this 
subsection arise from scalar-tensor theories the scalar momentum is negligible, so one is 
then left with only the 'gravity' supermetric. 

Thus in principle there could be different possible early universe behaviours which admit 
3 different sorts of strong gravity limits, corresponding to indefinite, degenerate and positive- 
definite i.p's. Each of these cases has a correspondingly different kind of natural candidate 
for the early-universe WDE i.p. 

1.4 Application to conformal gravity 

Another application of the W < ^ theories follows from conformal gravity (see III. 2) being 
T^-insensitive and thus expressible in terms of a = < ^ supermetric. I will additionally 
show that a sequence of theories can be formed: W = strong gravity, conformal strong 
gravity, conformal gravity, which could permit the isolated study of some of the novel 
features of classical and quantum conformal gravity (see V.2.1.3) 

In answer to whether arguments from the II. 1.3 are applicable to conformal gravity, 
begin by noting that there is no 'expansion term' p in conformal gravity. Because this 
absence is due to p = being separately variationally imposed (rather than due to = 
occurring for the vacuum theory) , the presence of non-minimally-coupled-scalars or the re- 
lated use of conformal transformations are unable to reintroduce a p into conformal gravity. 
Consequently, conformal gravity cannot be included among the 'wider range of theories' for 
which the less stringent bounds on W mentioned in 1.3 are applicable. Conformal gravity 
is a theory in which W plays no role at all. Presumably the classical and quantum study 
of conformal gravity on Superspace with W < ^ and W > ^ are equivalent once projected 
down to conformal Superspace. Working out how this happens may be interesting and 
instructive, at least from a theoretical point of view. 

1.5 Difficulty with implementation of Ashtekar variables 

This section includes theories for which the i.p Problem of quantum GR is altered (if not 
ameliorated). In the case of conformal gravity, its preferred foliation additionally represents 
an attempt to circumvent the Problem of time of quantum GR. One must however recall that 
these Problems of quantum gravity are always intertwined with other formidable Problems, 
which include operator ordering and regularization. At least in GR, Ashtekar variables 
have nice properties as regards these Problems (1.3.3.3). scalar-tensor theories or conformal 
gravity admit an analogue of Ashtekar variables. Indeed, how special is GR in admitting 
Ashtekar variables with their nice properties? 

In the strong-coupled hmit of GR, the constraints (|269|1 . (|27n|) and (|27H1 become [56 |ll93j 
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:Aa,E"] = , 
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strong gravity spacetimes. 
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tr(E"E^[Aa, h]) -hA = . (372) 

By the cyclic property of the trace and use of ()37U|) . that ()371() is redundant as claimed in 
III. 1.1. Furthermore, there is an equivalent form for the remaining constraints H37U|) and 

(EH innj: A[,,] = , (373) 



Kb^cdG^''" = A (374) 

(for Aab = tr{ka<Jh))-, which manifestly displays the dependence on the (now overall unden- 
sitized inverse) DeWitt supermetric Q^^'^'^. I then investigate what happens when G"'"^'^ 
is replaced by G^'^'^. Notice how then the Hamiltonian constraint no longer contains a 
truncation of the natural object F*^^ = 2d^a^b] + ll^ii-'^b]!* SU(2) Yang-Mills theory, in 
correspondence with W ^ 1 strong gravity not being a natural truncation of the GR Hamil- 
tonian constraint. The constraint algebra closes. 

For full scalar-tensor theory, I do not think Ashtekar variable with W ^ 1 will work. 
One has there the option of making conformal transformations to put scalar-tensor theory 
into aW = 1 form, but the conformal factor required then causes the constraints to be non- 
polynomial [71]. As for conformal gravity, one could as well write the theory with W = 1, 
but I see no way that conformal gravity's lapse-fixing equation (|4()9j) can be expressed 
polynomially. All these theories' lacks could be viewed as a 'second RWR result': the 
form of the supermetric in is fixed to be DeWitt's GR one if one requires passage to 

Ashtekar variables with their quantization-geared neatness. 

1.6 PDE problems for strong gravity theories 

The 5 d.o.f version of the theories has a trivial thin sandwich formulation, as a consequence of 
having no thin sandwich equation. Beware this simple state of affairs since it is reminiscent 
of Minisuperspace models. Also, it only works for the A < case by the Problem of zeros 
corresponding to the thin sandwich approach. But within this case there are globally no 
zeros, a decided improvement on the situation in GR. The condition A < has the knock-on 
effect of forcing > from the Gauss constraint. 

The IVP involves solving the constraint K oK -\ K = A . 

^ (375) 

No scalings are fixed in the 'York IVP method' corresponding to these theories, because there 
is no Codazzi constraint to keep conformally-invariant and the conformally-transformed 
Gauss equation can contain no l-DV'P terms because it has no R in the first place. This 
latter point means that the IVP now involves no differential equations at all: the 'York IVP 
method' is now an algebraic method. If one declares everything to be known except the 
scale of the metric, one can scale the various objects in the Gauss constraint as one pleases 
and explicitly solve it algebraically for the scale. The reason for the GR study's restriction 
to CMC slicings is also absent because there is no Codazzi constraint. 

The 2 d.o.f version of the theories has a nontrivial thin sandwich formulation. As above, 
the Problem of zeros becomes the mere requirement that —A > ^ > which is a global 
rather than local condition. Although many of the counterexamples hold irrespective of W 
and of the fixed sign of the potential, I have reasons to suspect the proofs will turn out to 



be TF-dependent. 



1-3W 

The IVP now involves solving the constraints H375jl and DiK ^■'H K = . 

(376) 

Now, one scaling is fixed in the 'York IVP method', because there is a Codazzi equation 
to keep conformally-invariant, but again the conformally-transformed Gauss equation can 
contain no l-DV'P terms because it has no R. So for hij and K^^^^ scaling as in (|233)) and 
H241|) . r] = —2C,. According to one interpretation there should be no p and j"" as these terms 
become negligible in the strong-coupled gravity limit (this is distinct from the 'Pilati limit' 
interpretation used in IV. 1.4 to illustrate other points). In this case the full constraints have 
less terms in them than in GR. The resulting IVP is not trivial: one starts off by solving a 
J* = momentum constraint. But then, in place of a Lichnerowicz-York p.d.e, one has 

3 

where we have demanded that A does not scale so that it remains a cosmological constant 
in the physical frame. Then for example, using the usual but now unforced r/ = 4, this has 



the trivial explicit algebraic solution '0 



12 



(378) 

The maximal case is never sustainable, since the corresponding lapse-fixing equation is 
trivially only solved by frozen dynamics (A^ = 0), independently of what asymptotics the 
spaces have: p ~ 3AA , A 7^ , A > . The CMC case is sustainable, trivially soluble 
and trivial in evolutionary character: 

d f V \ 3A — 2 
B(A) = — = 3AA + ^ — ^ N = spatial constant . (379) 



dX h 2 

Finally, note thus that the conformal thin sandwich approach to strong gravity just reduces 
to solving the Codazzi equation, now for the shift rather than the vector potential Wi. 

What of the Cauchy problem for these equations? The indefinite signature means that 
only derivatives with respect to label time are contained in the evolution equations. Thus 
the Cauchy problem is merely that for an o.d.e, and may be cast into the form hij = 
F{hij, hij;^i; N) away from the bad value W = ^. 

2 Conformal alternatives 



2.1 Introduction 



Recollect from II. 3 that there is a tension between the spacetime and split spacetime ge- 
ometrodynamical interpretations of GR. The TSA would add to this tension if it leads to 
realistic geometrodynamical theories in which the GR structure of spacetime is not emer- 
gent. III.l contains toy examples of this, appropriate for extreme regimes. This section is 
about attempts to build such theories which are realistic in everyday as well as extreme 
regimes. 

In the usual geometrodynamical interpretation of GR, the configuration space is the 
extension of Riem to Riem x B x P, where B is the space of shifts /3* and P is the space 
of lapses a (which are positive functions). However, since a and f3^ have no conjugate 
momenta, the true gravitational d.o.f 's of GR are contained in Riem. They are furthermore 
subjected to 7i and TCi. If, in the thin-sandwich problem for geometrodynamics, one could 
solve 7ii in terms of the Lagrangian variables for /?*, then the theory would be defined on 



the quotient, Superspace. However, because of H, Superspace still contains one redundant 
d.o.f per space point. Then GR is a BM RI theory whose true configuration space is 
contained within Superspace. As part of this interpretation it is natural and equivalent 
to regard A'^ and ^* (which become identified with a, as velocities associated with 
cyclic coordinates rather than as Lagrange multiplier coordinates. Also (1.2.9.4.2), one may 
take the true configuration space of CWB GR as close to being Conformal Superspace 
(CS). In this section the TSA to geometrodynamics is tentatively extended to include scale 
invariance. The resulting conformal gravity theory is a BM RI theory on CS; the role of 
the auxiliaries being velocities and not coordinates plays a role in deriving this from first 
principles. Whereas conformal gravity, has an additional linear constraint p = 0, the theory 
still has 2 d.o.f's per space point. To form a configuration space of CWB GR, one requires 
to adjoin the single global volume of the universe to CS, thus forming a space 'CS+V. 
Theories on CS+V will also be developed. 

In III. 2. 2, I somewhat rework Barbour's scale-invariant particle dynamics model, to 
be used to help test out a number of conceptual and technical issues for scale-invariant 
geometrodynamics. This model requires E - V to be homogeneous of degree - 2 in the 
particles' relative positions, which docs not disrupt usual physics by the trick of division by 
suitable powers of the moment of inertia. I begin by treating the auxiliary a responsible for 
scale invariance as a coordinate. a- variation yields a new constraint Q{i)P'^'^^ = 

analogous to p = 0. I next treat the auxiliaries as velocities and give a derivation of 
the scale-invariant particle dynamics theory from BM first principles. This involves the 
compensatory auxiliary field giving the action an overall 'banal' scale invariance, and the 
use of a free-endpoint variational principle. 

In III.2.3, I give a 2-auxiliary Lagrange multiplier coordinate (0 and 6) formulation 
of conformal gravity, ^-variation yields the p = constraint. The maintenance of this is 
guaranteed by the lapse-fixing equation (LFE) arising from ^-variation. For this LFE to be 
workable, one requires division by a suitable power of the volume of the universe, V. For 
this to be workable, one must choose between e.g CWB or asymptotically flat before the 
theory of gravity is declared. 

In III. 2. 4, we arrive at conformal gravity instead from the 3-space principles of de- 
manding a consistent RI BM action where there is now both Diff-BM and Conf-BM. The 
conformal gravity action is required to be homogeneous of degree zero in the auxiliary vari- 
able (j), which is implemented by division by the suitable power of V. Wc find that what was 
previously considered to be a second auxiliary coordinate 6 is in fact closely related to the 
velocity corresponding to (p. However, there are still 2 separate equations arising from the 
auxiliaries by the free-endpoint variational principle. In III.2.5 we include a cosmological 
constant as a precursor of the inclusion of matter in IV. 2. 

In III. 2. 6 wc work in the Hamiltonian formulation treating whichever auxiliaries arise as 
Lagrange multiplier coordinates. We provide a range of CS and CS+V theories. The idea 
is to explore the whether CWB GR in York's formulation is closely analogous to some BM 
theory in CS+V, in which given an initial point and an initial direction in CS+V, a unique 
dynamical curve is determined. In both GR and CS+V theory, there is a unique definition 
of simultaneity given by CMC slicings (corresponding to York time). CS+V schemes may 
be the more promising ones as regards both reformulating GR and establishing less radical 
but more secure departures from GR. Further theories which share some features with GR 
and conformal gravity are also presented, including the asymptotically-flat counterparts of 
conformal gravity and the CS+V theory. Material supporting III. 2. 2-6 drawn from the GR 
conformal IVP method is provided in App III. 2. A. A comparison of conformal gravity with 
other conformally-invariant theories is provided in App III.2.B. 



Whereas for conformal gravity the first principles derivation is conceptually helpful, it is not clear how 
to do this for alternative theories considered later. This is why I also provide the auxiliary coordinate 
interpretation. 



Here is a summary of which aspects of these theories are considered further on in the 
thesis. Matter is included in IV. 2: it turns out, as in RWR, that a universal null cone, elec- 
tromagnetism and Yang-Mills theory are picked out. There is a discussion in V.2: conformal 
gravity should be in agreement with the standard tests of GR, and strong differences will 
emerge in cosmology and in the quantum theory; these differences will be less pronounced 
for CS+V theories. In V.2. 2 I improve on the CS+V theory, and extend the list of conformal 
theories. In V.2. 3 I provide and an account of how conformal gravity and the CS+V theory 
work as p.d.e systems, from both the IVP~CP and thin sandwich perspectives, update the 
cosmology and provide a comparison of CS+V theories with GR. The quantum treatment 
is updated in VIII. 



2.2 Scale-invariant particle dynamics model 



Here I rerun the Machian program for an n-particle mechanics with the additional suppo- 
sition that scale should be relative. Then one should demand invariance under the 

transformations q^.^ — > A(A)q^.j+g(A) , A(A) G SO(3)x5R+ = SL(3) , the linear group 

(380) 

A /(A). (381) 

The group of the first of the above two transformations is now not the Euclidean group 
Eucl, but the similarity group Sim = {translations, rotations and dilations}, which does 
not preserve lengths but does preserve angles and hence shapes. 

The relative configuration space is then the quotient with respect to this similarity group 

Shape Space SS(n) = ^ = . (332) 

him dilations 

This is trivial for n = 1, 2 and has dimension 3n — 7 for n > 3. For n = 3, SS is Barbour's 
toy space Triangle Land [S^, the space of all triangular shapes. 

Barbour then works indirectly on SS by BM: replace bare velocities according to 

%) ^k, n, eq(,) = q(,) - k - x q^^^ - ^q^.^ . (383) 

This involves a new dilational correction to the particle velocities; for the moment I consider 
bringing in this and the usual translational and rotational corrections by use of auxiliary 
coordinates. A RI action is as ever used to implement the invariance under the second 

_ (n) 

transformation 



J dAL = J dA\/E - vVt , T= ^ m^i)^i^, n, e^^^^ySik, n, eq, 



(0 

W={i) 

(384) 



The conjugate momenta are 



p«^ 



dq 



E-V 



-m 



{i)fik, n, eq(.) • 



(385) 



(") r) . D 

= E — V . 

m(i) 

(0=(i) 



(386) 



Variation with respect to the auxiUaries then yields as multipher equations the usual 2 
BB82 conditions M = C = and furthermore the vanishing of the dilational momentum: 



in) 

W={i) 



%) • p 



. 



(387) 



This means that the moment of inertia of island universes described are thus conserved, 
which would not generally be true in Newtonian mechanics. 



The ELE's from variation with respect to q^ .^ are 
dl 



P 



9q, 



Hi) 



E-Vdq 



+ 



E- V. 



(388) 



These maintain the constraints = £ = as before, and also maintain D = provided 
that V - E is homogeneous of degree —2: 



(n) 

«=(i) 



d{y -E) _ 1 

(i)={i) ^{i) 




d 



(i)={i) 



(V-E) , (389) 



where the second equality makes use of the primary constraint (|386() . 



At first sight, this homogeneity requirement is inconvenient because Coulomb's law ()57|) 
and Newton's law of gravitation involve ^ and not ^ potentials. But there are actually 
reasonable ways of making ^ potentials give effective physics similar to that for ^ potentials. 
Barbour's favoured method to achieve this is by use of appropriate powers of the moment 
of inertia, which depends on r but seldom changes much. Barbour's proposed action built 
along these lines is ()384|) with 



V 



EE 



(390) 



To aid the development of conformal gravity, I now discuss two further possible schemes 
for the above working. First, to make proper sense of the action as a RI action, the 
auxiliaries associated with scaling, translation and rotation should all be regarded as the 
velocities associated with cyclic coordinates to which free-endpoint variation is applied. Let 
the scaling auxiliary hitherto used he = 9. I also set C = Ina, whereupon the action is 



dA 



\ 



v(q, 



(n) 

« E 

«=(1) 



(391) 



which corresponds to constructing the action in an arbitrarily-scaled frame (by passing to 



Foster and Barbour's corrected coordinate) 



Hi) 



aq 



(392) 



and writing the action as 



dA 



V(aq 



(") d(aq,.,) 



{i)=(i) 



dX 



(393) 



The action then has the fohowing 'banal invariance' under scahngs: 



I — > I under < 



(394) 



To get the same field equations as before, consider 5b and 5b as separate variations. Then 



(n) 



the primary constraint reads 



W={i) 



(395) 

variation with respect to a gives = -p^, and free-endpoint variation yields = 0, so 
V = 0. 

This second procedure may not be very general. While it works for translations, dila- 
tions and indeed conformal transformations, it is at least obscured for the non-commuting 
rotations. 

2.3 Two auxiliary coordinate formulation of conformal gravity 

I now present conformal gravity much as it was originally conceived by O Murchadha j4() | lll j . 
using two auxiliary coordinates to give and maintain the scale-invariance. Conformal gravity 
has subsequently been explored from the single auxiliary corrected-coordinate perspective 
(III. 2. 4). The style below is available further alternative theories in III. 2. 6, while it is not 
clear whether these possess a corrected coordinate formulation. 

We desire a theory on CS, i.e on Riem with restrictions due to DaP""^ = and p = 0. 
The first restriction is familiar: it means that we are within Superspace. Thus one is to 
use corrections to the metric velocities that are Lie derivative with respect to an auxiliary 

variation with respect to which encodes Dap"'^ = The second restriction is dealt with 
analogously: introduce a suitable correction to the metric velocities associated with some 
auxiliary 9, variation with respect to which encodes p = 0. On inspection, this requires the 
additional correction 



hi 



£i:h 



hi 



£f:h. 



9hi 



(396) 



9 can be interpreted as some C in conjunction with free-endpoint variation so that the action 
is truly homogeneous of degree 1. Thus one might consider an action (in analogy with the 



BSW formulation of GR) 



dA / d^xVhx/RTf. , 



(397) 



for 



Tf. — G°''"^'^Si^^0hab&^,ehcd ■ 



(398) 



The conjugate momenta are then 



h 7? 

ab _ I r^ahcd — o l 



(399) 



1 



As ever, these are related by a primary constraint --j= ypop ^ 



- VhR = . 



(400) 



and ^*-variation yields 



DaP"' = 



(401) 




Note however that p = is one constraint per space point, corresponding to the local 
conformal transformations, in contrast to the global scale transformations. Just as P ensures 
moment of inertia conservation for island universes, p = ensures the conservation of the 
volume for spatially compact universes. 

When it comes to propagating this new condition, we know that in GR this is achieved 
where possible by solving the maximal LFE. Now, this can be successfully encoded by 
bringing in another auxiliary (j) into the action as follows: use 



and (j) variation gives the maximal LFE, in the distinguished representation (the one in 
which (j) = 1). This distinguished representation is the one in which the theory being 
developed most closely resemble the ADM split of GR. In III.2.6 the ELE's, the LFE and 
the working that the constraints propagate is all checked to be independent of this choice. 

However the maximal LFE AA^ = NR is well-known to be insoluble in CWB GR (see 
C.l). This shortcoming of GR maximal slicing was bypassed by York's generalization to GR 
CMC slicing, which has a nicer CWB LFE (see C.l). We bypass it here in a different way 
envisaged by O Murchadha: consider instead of (|403|) an action in which division by some 
power of the volume of the universe occurs. This was inspired by the Sobolev quotient used 
in the proof of the Yamabe result [ (|469|) of App III.2.A], and is analogous to the division 
by the moment of inertia in the particle model. There is then the issue of what power of 
the volume should appear in the new action. The answer is |. One can arrive at this by 
arguing as in the next subsection that the action should be homogeneous of degree in 
(j) so as not to include volume among the d.o.f's (since otherwise the action would not be 
invariant under the constant scaling). 

Alternatively, one can argue that this particular power is necessary for consistency. 
Suppose the action is 




(403) 




(404) 



for some n. Then variation with respect to (j) yields the LFE, which is 



RN - D'^N = -n<NR> 



(405) 



in the distinguished representation. Then integrating this over space yields 
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fd'yynA]r)R{y)N{y) 
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d^xVhRN , 



(406) 



because of CWB, Gauss' theorem, the LFE, and that spatial integrals are spatial constants 
and can therefore be pulled outside further spatial integrals. Then the same integral incon- 
sistency argument as for the maximal LFE holds unless n = ^ (see App C). 
Thus the desired action is 




(407) 



with corresponding constraints — = — j"-^ i=P°P = > D},p°^ = , p = . 

(408) 



and LFE RN-D'^N =< NR > 

(409) 

in the distinguished representation. Whereas in the TSA to GR and are treated as 
freely specifiable gauge velocities, in this conformal gravity N is fixed, and its role as gauge 
variable is taken over by 0. Since plays the same role in both theories, conformal gravity, 
like GR, has two d.o.f's per space point. However, in contrast to GR, they are unambigu- 
ously identified as the two conformal shape d.o.f's of the 3- metric hij. Conceptually, this 
is a pleasing result, but it has a far-reaching consequence - conformal gravity cannot be 
cast into the form of a 4-d generally covariant spacetime theory. Because the lapse is fixed, 
absolute simultaneity and a preferred frame of reference are introduced. The reader may 
feel that this is too high a price to pay for a scale-invariant theory. Of course, experiment 
will have the final word. However, one of the aims of pursuing this option is to show that 
conformal invariance already has the potential to undermine the spacetime covariance of 
GR. 

2.4 Single auxiliary Lagrangian formulation of conformal gravity 

My 3-space first principles treatment of conformal gravity is to consider an arbitrary-frame 
RI action. The notion of 'frame' now includes both Diff and Conf (all A-dependent). This 
will lead to Diff-BM and Conf-BM, i.e beginning with the highly redundant configuration 
space Riem, we add to that degeneracy by forming a Diff- and Conf-BM action on 
Riem x E x P, where H is the space of Diff-BM-encoding auxiliaries and P is the space 
of suitably smooth positive Conf-BM-encoding auxiliaries (p. 

Under Conf, the metric transforms according to hij — > hij = Lo^hij , 

(410) 

where the scalar u is an arbitrary smooth positive function of the label A and of the posi- 
tion on the 3-space. The fourth power of lo is the traditional one used for computational 
convenience, for the same reasons as in L2.9.3. Then one has 

h^^ = oj-^h'^ , (411) 
Vh = u^Vh , (412) 

R^^^^U-^^y (413) 

To work in the arbitrary conformal frame, one is to use the Foster-Barbour corrected 
coordinates 

hij = (t>%3 ■ (414) 



Note that there is then invariance under the pair of compensating transformations 



Kb — >oj hab , (415) 



</> ^ - , (416) 

to 

which is the geometrodynamical generalization of the invariance under the banal transfor- 
mation (|394|) of the particle model. 

Now, parallehng the derivation of (|ITT] - IIT^ . (|^TH) leads to 

h'^ = </,-4/iii , (417) 



Vh = (j)^Vh, (418) 



R = ^-^lR-^^\, (419) 



Additionally, Q^hab = ^^{(p'^hab) = (fy" [^ihab + 4^/ia6 

which gives the Conf-BM metric velocity. 

From (|417|) and H42U|) we can make a T that is Conf-invariant: 

T^w = r''Gt%{(^^Kb)^^[cl,^Kd) t^w = T^w • (421) 
Note that we begin with an arbitrary W in the inverse supermetric 

G^""^ = Vh{h'"'h^'^ - Wh^-^h""^). However, whereas W = 1 was found in RWR |HH] to be 
crucial for the consistency of GR, we will see that this apparent freedom does not play any 
role in conformal gravity. 

One could likewise make a Conf-invariant potential density using the metric alone, from 
available conformally-invariant Bach tensor and the Cotton- York tensor of 1.2.9.4.3, but 
the suitable combinations they give rise to are cumbersome, e.g 

hbabchdefG^'^'^fy , h~\ (y'^S^'^GZc^'' , fl^"^6afecW)' (422) 



(420) 



def^ j , ri yy y <^abcd ) y "abcOdef ^ 

where T'''""^^^ = J2p(^Se K(p)tP(")P('')tP(^)p('^)tP(^)p(-^) for any tensor where K{p) are a 
priori free coefficients and Sg is the permutation group of 6 objects. 

We instead resorted to using the auxiliary variable (p, in addition to the metric, in order 



to build our potential. 




(423) 

Now in contrast to (|421j) and the situation in relativity 138 , each term in the potential 
density part of a generalized BSW action changes under ()415() . This has the consequence 
that the action of conformal gravity must, as a 'conformalization' of the BSW action, depend 
not only on the velocity (p of the auxiliary variable (p but also on itself. This parallels 
what was found for the gauge variable a of the dilation-invariant particle action. 



So our action for pure (matter-free) conformal gravity is, upon additionally dividing by 
the homogenizing power of the volume as explained below: 



dA 



V 



\ 




vl 



cw 



(424) 



where V is the 'conformalized' volume ^ ~ j ^^"^^^"^^ ' 

(425) 

This division by the volume implements homogeneity of degree zero in (p. We next explain 
this homogeneity requirement and comment on this means of ours of implementing it. 

hij, dA ) ' I = / dA / d'^xL . 

Provided we ensure that L is a functional of the corrected coordinates and velocities, it 
is bound to be invariant under the conformal transformations (|416|) . This would be the 
case if we omitted the volume V in (|424() . But by the rules of BM suggested in III. 2. 2 the 
Lagrangian must also be invariant separately under all possible variations of the auxiliary 
variable. Now in this case the auxiliary is a function of position, so one condition that L 

dl 

must satisfy is --— = if (h = spatial constant . 

ocp 

(426) 

A glance at (j424|) shows that if V were removed, (j426p could not be satisfied. The action 
must be homogeneous of degree zero in <p.^^ We exhibit this in the first expression in 
(|424|) , in which the two expressions (|418[ I419|) that derive from and in the BSW 
action have been multiplied by appropriate powers of V. (The second expression is more 
convenient for calculations.) 

A separate issue is the means of achieving homogeneity. Barbour and O Murchadha 
originally attempted to achieve homogeneity by using not R2 but R^^ since VhR^ will 
satisfy (|426|) . However, this already leads to an inconsistent theory even before one attempts 
'conformalization' (see II. 2. 2. 3). Since an ultralocal kinetic term T has no conformal weight, 
another possibility would be, as mentioned above, to construct a conformally-invariant 
action by multiplying such a T by a 3-d conformal concomitant scalar density. Much 
hard work would be required to investigate whether any such possibilities yield consistent 
theories. Even if they do, they will certainly be far more complicated than conformal 
gravity. 

We have therefore simply used powers of the volume V in the style of the Yamabe 
work. This gives us conformal gravity. The use of the volume has the added advantage 
that V is conserved. This ensures that conformal gravity shares with the particle model the 
attractive properties it acquires from conservation of the moment of inertia. We extend this 
method (of using powers of V to achieve homogeneity) in IV. 2 to include matter coupled to 
conformal gravity. It is the use of V that necessitates our CWB assumption. It is a physical 
assumption, not a mere mathematical convenience. It would not be necessary in the case 
of theories of the type considered in the previous paragraph. 

We must now find and check the consistency of the equations of conformal gravity. 
The treatment of BM in the particle model in III. 2. 2 tells us that we must calculate the 
canonical momenta of hij and (j), find the conditions that ensure vanishing of the variation of 

^*This is the single symmetry property that distinguishes conformal gravity from GR, which for this reason 
Just fails to be fully conformally invariant. 



the Lagrangian separately with respect to possible independent variations of the auxiliary 
(j) and its velocity = (p^ and then show that these conditions, which involve the canonical 
momenta, together with the ELE's for hij form a consistent set. This implements BM by 
the free-endpoint method. 

The canonical momentum pj^ of (/> is P<h = — = ——-^G'^'^\!>f{ch'^hab)—h(.d , 

(427) 



where 2A^ = — '-f^iT- The gravitational momenta are p""^ = ——-^G^'^?>^{4>^hcd) 
" — J- 2NVi 



(428) 



Thus one has the primary constraint p = —p^j, . 

(429) 

The primary constraint (|429|) is a direct consequence of the invariance of the action (|424j) 
under the banal transformations H416() . 

Independent variations 6(j) and 6V(j, of <j) and its velocity in the instantaneous Lagrangian 
that can be considered are: 

1) 54> is a spatial constant and dv^p = 0. 

2) S(f) is a general function of position and (5^^ = 0. 

3) 6(j) = and 6V(f) 7^ in an infinitesimal spatial region. 

The possibility 1) has already been used to fix the homogeneity of L. Let us next consider 
3). This tells us that = 0. But =■ Pcj,, so we see that the canonical momentum of cj) 

must vanish. Then the primary constraint (|429j) gives p = . 

(430) 

As a result of this, without loss of generality we can set W , the coefficient of the trace, 
equal to zero, TV = 0, in the generalization of the DeWitt supermetric used in ()424p . If 
conformal gravity proves to be a viable theory, this result could be significant, especially for 
the quantization programme (see II. 6), since it ensures that conformal gravity, in contrast 
to GR, has a 

2N ( P \ 

positive-definite kinetic term. Indeed, in GR, since hab = —1= \Pab — -zhabj +2I?(aC6) > 
the rate of change of \/7i (which defines the volume element \//id^x) is measured by p, 

Vh = -^ + VhDae- (431) 

Therefore, in conformal gravity the volume element - and with it the volume of 3-space - 
does not change and cannot make a contribution to the kinetic energy of the opposite sign 
to the contribution of the shape d.o.f's. Finally, we consider 2) and (including the use of 

P0 = 0) we obtain (l)^N (r - -D^{^^N) = cj)^ ((j)^N (r-^ 

(432) 



where we use the usual notion of global average: 



<A>= 



J d^xVhA 

^fdWT ■ 

(433) 



The lapse-fixing equation (LFE) (|432j) has a status different from the constraints and it has 
no analogue in the scale-invariant particle model, or in any other gauge theory of which 
we are aware. We get this LFE and not a further homogeneity requirement such as in the 
particle model, because although the action is homogeneous in we are now dealing with a 
field theory in which there are spatial derivatives of (j). It is, however, a direct consequence of 
Conf-BM, and as explained below, it plays an important role in the mathematical structure 
of conformal gravity. 

Besides the trace constraint (|43U() . p""^ must satisfy the primary constraint 



due to the local square-root form of the Lagrangian, and the secondary constraint 

Di.p''^ = (435) 

from ^*-variation. Of course, H435|) is identical to the GR momentum constraint H84|l. while 
(|434|) is very similar to the GR Hamiltonian constraint (|<S3jl . 
The ELE's for hij are 

^3 V 4> J Va 

^^^^ ac b I f'^uabucd uacubd 
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cp<^h-' (c^'N [r - ^) ^ + ^p-\<^ . (436) 

They can be used to check the consistency of the full set of equations, constraints and 
LFE of conformal gravity. To simplify these calculations and simultaneously establish the 
connection with GR, we go over to the distinguished representation in which (j) = 1 and 
^* = 0. The three constraints that must be satisfied by the gravitational canonical momenta 



are 



Vh 



p o p = 0, DbP' 



ab 



p = 
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The LFE (IM becomes D^N-NR = - <NR> , 

(438) 

and the ELE's are 

^ab ^ :^^Iihab _ j^ab^ _ ^^f^abj^2j^ _ Jjajjb^^ _ ^^^ac^b^ _ ^^ab ^NR>. 

Vs V3 Vh 31/3 

(439) 



Since the volume V is conserved, and its numerical value depends on a nominal length- 
scale, for the purpose of comparison with the equations of GR we can set V = 1. (This 
cannot, of course, be done before the variation that leads to the above equations, since 
the variation of V generates forces. It is important not to confuse quantities on-shell and 
off-shell.) We see, setting V = 1, that the similarity with GR in York's CMC slicing is 
strong. In fact, the constraints (|4U8() are identical to the GR constraints and the CMC 
slicing condition at maximal expansion, and the ELE's differ only by the absence of the 
GR term proportional to p and by the presence of the final force term. Finally, there is the 



LFE (|4ci8|) . which is an eigenvalue equation of essentially the same kind as the LFE (|267|) 
required to maintain York's CMC slicing condition ()262|) . 

In our view, one of the most interesting results of this work is the derivation of such 
LFE's directly from a fundamental symmetry requirement rather than as an equation which 
could be interpreted as maintaining a gauge fixing. We now develop this point. 

This is part of the confirmation that we do have a consistent set of equations, constraints 
and LFE. We show this in III. 2. 2. 6, which amounts to demonstrating that if the constraints 
(|inS|) hold initially, then they will propagate due to the ELE's and the LFE The 
propagation of the vector momentum constraint is always unproblematic, being guaranteed 
by the 3-diffeomorphism invariance of the theory. The two constraints that could give 
difficulty are the quadratic and linear scalar constraints. In RWR |HH]) the propagation 
of the quadratic constraint proved to be a delicate matter and generated the new results 
of that paper. However, in this section we are merely 'conformalizing' the results of j38j . 
and we shall see that the consistency achieved for the quadratic constraint in jSH] carries 
forward to conformal gravity. The only issue is therefore whether the new constraint p = 
is propagated. Now, we find that the form of p evaluated from the ELE's is automatically 
guaranteed to be zero by virtue of the LFE. Thus the propagation of p = is guaranteed 
rather than separately imposed. It is in this sense that conformal gravity is not maintaining 
a gauge fixing. 

The same conformal gravity theory can also be reached using the 'bare principles' of 
II. 2. 2. 4, provided that one takes the Dip = branch of possibilities to propagate the 
Hamiltonian constraint. Notice how one gets two conditions to encode, and the encoding 
of the second of these (the LFE) does not look like any encodements we have seen before. 
We will also see how conformal gravity can be formulated in a lapse uneliminated fashion 
in VI. 

Finally, we briefly mention two sources of variety. First, we find that there is analogously 
a Euclidean conformal gravity and a strong conformal gravity (which unlike in III.l has 
without loss of generality W = 0), with action 

|Strong ^ / _ ^ (^440) 

which may be of use in understanding quantum conformal gravity. Note that the power 
of the volume V needed to make the action homogeneous of degree zero is here one, since 
now it has to balance only and not the product \fh\fR. This theory is simpler than 
conformal gravity in two ways: A is less intricate than i2, and the LFE for strong conformal 
gravity is AA^ =< AA^ > so since < AA^ > is a spatial constant, A^ is a spatial constant. 

Second, instead of constructing an action with a local square root, one could use instead 
a global square root and thus obtain 



I = / dA^ , , (441) 

J Vs 

which gives rise to a single global quadratic constraint. The above alternatives are cumula- 
tive: for both conformal gravity above and each theory below, we could consider 6 variants 
by picking Euclidean, strong, or Lorentzian signature and a local or global square root. The 
Lorentzian, local choices expanded throughout are the most obviously physical choice. 

2.5 Integral conditions and the cosmological constant 



The coupling of matter originally left BFO at a loss through Foster's demonstration of 
the appearance of integral inconsistencies in the actions they were considering. The idea I 



used to resolve this problem comes from the analogous attempt to combine the actions of 
strong conformal gravity and conformal gravity in order to consider conformal gravity with 
a cosmological constant, A. We give the combined action obtained by applying homogeneity 
of degree zero in (p: 



dA / d'^x 




J 



dA ^- 5 — = / dA^ , (442) 

where we have also included a signature a to show how ()442|) reduces to the strong conformal 
gravity action H44U|) in the limit a — > 0. That this is a correct way to proceed is emphasized 
below by the lack in this case of an integral inconsistency. 

The conjugate momenta p^^ and are given by (|427j) and (|428j> as before, but now with 
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and the primary constraint (|429|) holds. Again, the end-point part of the (j) variation yields 
P0 = 0, so p = 0, so without loss of generality W = 0, but now the rest of the ^-variation 
gives a new LFE, 

o,.. 3A^A J <A^A> 

1^ I 
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in the distinguished representation. For this choice of the action, there is indeed no integral 
inconsistency: 



= 2a VkdSaD^N = 2a J d^xVhD'^N 
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The ^'-variation yields the usual momentum constraint ()84() . and the local square root 
gives the primary Hamiltonian-type constraint 



^{<jR+ , 



2 



A 

in the distinguished representation. The ELE's are 
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fi,p^^- = ^{h^^R - R^n + ^{DWm - h^W'N) - ^ 
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where we have split the working up into pure conformal gravity and A parts (the J here 
can also be split into the pure conformal gravity integrand and a A part). The A part of 
the working for the constraint propagation presents no extra difficulties. 

Note that in conformal gravity the cosmological constant A (just like its particle model 
analogue, the Newtonian energy E) contributes to a conformally-induced cosmological- 
constant type force. The penultimate term in (|447() is the final term of pure conformal 



gravity in (|439j) . and the final term is induced by A. We will see in IV. 2 that matter also 
gives analogous contributions. The significance of this is discussed in V.2. 



2.6 Hamiltonian formulation and alternative theories 

In this section, we examine the Hamiltonian formulation. We exercise two options in con- 
structing theories such as GR and conformal gravity, so that this section is more general 
than IV. 2. 5. First, while we have hitherto only considered the CWB case, we now also touch 
on the asymptotically-flat case. Second, we also attempt to incorporate both maximal and 
CMC conditions. 

Our methods of doing so necessitate a discussion of whether the corresponding LFE's 
are fundamental or a gauge- fixing. When we conformally-correct only the hij, I think 
that one can interpret the slicing condition as pertaining to a particular slice and then 
either adopt the LFE which propagates it, or not, as a continued gauge choice. When we 
conformally-correct both hij and hij, we find that the corresponding LFE is variationally 
guaranteed. This is the case in conformal gravity. I think such fully-conformally-corrected 
theories are the more complete ones. Below I build up the Hamiltonian structure toward 
this completeness. 

We begin with the case that corresponds to working on Superspace. Recollect how ADM 
and Dirac showed that the Hamiltonian for GR can be written as 

H = y d^x{Nn + C'H^) (448) 

where and ^* are regarded as arbitrary auxiliary Lagrange multiplier coordinates. Now 
we wish to parallel this work on the space CS-I-V by working with CMC slices. There is 
then little difference between the compact and asymptotically- flat cases. Our treatment 
simply involves adding another constraint to the Dirac-ADM Hamiltonian: 

H'' = y d^x[Nn + Cn, + {D'^y])p\ (449) 

and treating i] as another Lagrange multiplier. The Laplacian is introduced here to obtain 
the CMC condition as the new constraint arising from r/- variation: D^p = ^ = C(A). 
In addition we get the standard GR Hamiltonian and momentum constraints from A^- and 
^*-variation. Now, we can choose to impose ^ ('^) ~ which gives the CMC LFE. 

Whereas one could reinterpret this as the study in the CMC gauge of the subset of (pieces 
of) GR solutions which are CMC foliable, we can also consider this to be a new theory with 
a preferred fundamental CMC slicing. The latter interpretation is our first CS-I-V theory. 

This CS-I-V theory's evolution equations are (|188j) and (|189|) but picking up the extra 
terms hahD^t] and —p°'^D^rj respectively. We have already dealt with the CMC constraint; 
it turns out that we need to set D^r] = to preserve the other constraints. Therefore the 
CMC Hamiltonian is well-defined when one uses the distinguished representation D^rj = 
and that A^ satisfies the CMC LFE. 

It is more satisfactory however to introduce a second auxiliary variable to conformally- 
correct the objects associated with the metric. We used (1 -|- D'^Q)'^ in place of cf) since this 
implements the volume-preserving conformal transformations VPConf: 

V -V = j d^xVh (1 + D'^C)^ ^ - j d^xVh = j d^xVhD^C = j dSaD^C = (450) 



®^By a preferred fundamental slicing, we mean a single stack of Riemannian 3-spaces. This is not to be 
confused with GR, where there are an infinity of stacks between any two given spacelike hypersurfaces (the 
foliations), each of which is unprivileged. 



in the case where the 3-space is CWB. Applying these corrections to the Hamiltonian 



ansatz H'^^ = j d?x[NrL^+C^^+{D'^V)p] , 

(451) 



Vh 



(452) 



in order for C-variation to encode the correct LFE NR — D^N H = B(X). 

^ 4:h ^ ' 

(in the distinguished representation D^(^ = D^rj = 0) to maintain the = C(A) given by 

r;- variation, we require that a = — | and j3 = ^. This corresponds to and 4>^ scahngs, 
i.e a relative scaling by (/)^^, between the trace and tracefree terms. Thus, (|45'2() happens 
to be the Lichnerowicz-York equation in (j), subject to its solutions only being admitted if 
they are in VPConf, cj) = {1 + D'^Qe . At this stage, we do not know why we are obtaining 
the same scalings as in York's work. 

We next try to extend the above workings to the case of the maximal condition. This 

involves starting with a slightly different Hamiltonian, = J d^x(N7i+S,^7ii—9p) . 

(453) 

0-variation gives us the maximal condition p = 0. In the asymptotically-flat case, it makes 
sense to impose p = to arrive at the maximal LFE D^N = RN , 

(454) 

since this is an extremely well-behaved equation when taken in conjunction with the bound- 
ary condition — > 1 at infinity. Again, A^- and .^'-variation yield the standard GR Hamil- 
tonian and momentum constraints respectively. Now if we choose 9 = in the evolution we 
preserve the Hamiltonian and momentum constraints. Therefore can be interpreted as 
yielding standard GR with a choice of maximal gauge, or alternatively that this represents 
a new theory with a preferred fundamental maximal slicing: our first asymptotically-flat 
theory. 

This does not work in the CWB case, since then now the only solution of (|454|) is A^ = 0: 
frozen dynamics. But if we use the volume of the universe (which amounts to moving away 
from GR), we are led to new theories. First, consider 



H'^v = [ d^xiNn"^ + en^ - Op) , = ^p o p - ^r . (455) 

J Vh V3 

Then the A^, 6 variations yield the constraints = 0, = and p = 0. Imposing 
p = yields a LFE D^N = RN- < RN > . 

(456) 

Whereas this could be regarded as a gauge fixing, the underlying theory is no longer GR. 
The other interpretation is that of a new theory with a preferred fundamental maximal 
slicing - a poor man's version of conformal gravity. 

Second, consider the use of two auxiliary conformal variables: 

H^'^^ = / dHNn^ + en. - Op) , ^ ^p op-^I^R- ^) . (457) 



Then the A^-, and 0- variations yield the constraints TiP = 0, = and p = 0. 
Hamilton's equations are now the evolution equations 

2NVI 

Si^hab = ^ Pah — Ohab , 
Q pab = ^ f ^h-b _ j^ab\ _ ^(/,a6^2^ _ ^a^fe^) ^ ^h^^p o p 

-^^^pVc - -\ {NR) h"-^ + Op"^ (458) 
in the distinguished representation cj) = 1. Whilst variation yields (|456() . Hamilton's 

equations give p = '^^{NR- < NR > -D^N) 

V3 

(459) 

which is thus automatically zero due to (^SHI)- This theory is (full) conformal gravity, as 
can be confirmed by Legendre transformation and BSW elimination to recover the 
Lagrangian of HI. 2. 5. Because the LFE that maintains the condition is also guaranteed, 
conformal gravity is definitely not interpretable as a gauge fixing. 

Before turning to these, we mention that there is a further (full rather than poor man's) 
maximal theory which arises from considering two conformal auxiliary variables in the 
asymptotically-flat case: 

H^o = [ d^xiNH''' + eUi - Op) , n't' = —^p o p - <P^Vh (r - . (460) 

Then iV, 6 variation yield the constraints = 0, = and p = 0. Now, cf) variation 
gives the maximal LFE (|454j) . and so automatically guarantees the propagation of the 
condition p = Q. Note that T-f^ = is the original Lichnerowicz equation (|264() . Again, 
the Lichnerowicz equation carries the auxiliary variable which encode the LFE. In contrast, 
in conformal gravity, one must modify the corresponding Lichnerowicz equation by the 
introduction of volume terms to get an analogous scheme. 

Like conformal gravity, our full asymptotically-flat theory has no role for p in its dynam- 
ics, but, unlike conformal gravity, it does not possess global terms. We are less interested 
in this theory than in conformal gravity because it would not be immediately applicable to 
cosmology, on account of being asymptotically flat. These two theories should be contrasted 
with our full CS+V theory, in which p does play a role, which means that the standard 
GR explanation of cosmology is available. We believe CS+V theory merits a full treatment 
elsewhere (started in V.2.2) as another potential rival to GR. 

We now consider the preservation of the other conformal gravity constraints Ti'^ and 
7ii. We find that we need to set = 0. Therefore (by comparison with GR) the only term 

2 . 

we need to worry about is the j < NR > /i" 

3^3 

(461) 

term in the p"^ equation. Since it is of the form Ch"''' it clearly will not disturb the momen- 
tum constraint. Therefore we need only worry about conserving the Hamiltonian constraint. 



This is quite straightforward. We first realize that 



(462) 



Hence 



V = 



(463) 



The only other term to worry about is 
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Yl 
7^ 



pop . 



(464) 



Varying gives 



3Vh 



< NR> p = . 



(465) 



Therefore the constraints are preserved under evolution. 



We now show that we can just as easily treat the Hamiltonian dynamics of conformal 
gravity in the general representation. The LFE from cp variation is now 



whilst Hamilton's evolution equations are now 



7D' 



8D^ 



2W3 

%^ab = Pab - Ohab 



(466) 
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We can compare these expressions to their Lagrangian analogues (|428|1 . (|436|) and we see 
they coincide if = ^g^- This will guarantee that the constraints are preserved by the 
evolution. Alternatively, we could evolve the constraints using the Hamiltonian evolution 
equations (|467() and discover that the constraints propagate if and only if the lapse function, 
A'", satisfies (|466|) . the shift, ^, is arbitrary, and 6 satisfies 6 = ^Q^4>- The variation gives 
the LFE (|432j) . We emphasize that 6^0 is arbitrary in the full theories, unlike in the poor 
man's versions, where one ends up having to set the auxiliary {9 or D'^t]) to zero. 

It is not obvious then that p = is guaranteed from Hamilton's equations, since what 
one immediately obtains is, weakly, 



p ^ 5 — I R 



vl 



vl 



V3 



2Vh 


V3 



R 



8D^ 



(468) 



We now require use of D'^^cp^N) = (j)D^{(j)^N) + 2h'"^dc{4>^N)dd(t> + (f^ND'^ct) to see that 
H432p indeed guarantees p = 0. 



App III. 2. A: Supporting material on conformal IVP method 



Links between our alternative theories and York's work 

With their preferred maximal and CMC slicings, our theories are written naturally in the 
conformal IVP language, and are best compared to GR via the York formulation of GR. 
York's work thus contains many useful concepts and tricks (see 1.2.9.3, 1.2.10, which were 
largely given as an early section in ABFO). Rather than being exactly the same, I will 
show that some of our tricks share a common origin with York's in V.2. The notion of CS 
and CS -\- V themselves follow from York's work in 1.2.9.4.2. York's work also offers as-yet 
unexplored options to some things, such as the style of treatment of matter in 1.2.9.6 and 
the possible alternative use of actions based on the 3-d metric concomitants of 1.2.9.4.3. 
I furthermore study the closeness of both CS+V theory and conformal gravity to GR in 
both the traditional IVP and the thin sandwich formulation in V.2.3. I next outline the 
additional piece of conformal mathematics that inspired the division by the volume that led 
to conformal gravity. 



The Yamabe conjecture 



Yamabe's |386j manifestly-global conjecture is that all n-metrics are conformally equivalent 
to metrics of constant scalar curvature. The proof for for i? < was relatively simple, but 
to prove the R > case Schoen j325j required a new global idea: the Sobolev quotient 



= (469) 

(presented here in 3-d). Note that the ELE corresponding to interpreting this as an action 
is the = 0, m = 1 vacuum Lichnerowicz-York equation. We are not concerned with 
what Schoen did with this quotient to prove the theorem, but rather with the idea of the 
quotient itself. 

The Yamabe conjecture is well-known in the GR IVP since the choice of constant cur- 
vature unphysical metrics simplifies the Lichnerowicz-York equation jl951 186j . 



Some Open Issues 



In GR, the CMC condition is a gauge fixing without fundamental physical significance. The 
solutions of conformal gravity will strongly resemble solutions of GR in the CMC foliation 
at maximum expansion. Some CS-I-V theories will resemble CMC-sliced GR even more 
closely. However, there are interpretational differences between those conformal theories 
that have a privileged slicing and reslicing-invariant GR. First, there is a restriction in the 
solution space of GR since not all GR spacetimes are CMC sliceable, nor is a CMC slicing 
necessarily extendible to cover the maximal analytic extension of a given spacetime |336j . 
Second, effects regarded as gauge artifacts in GR, such as the 'collapse of the lapse' N — > 
in gravitational collapse (1.2.10) can have corresponding physical effects in privileged-CMC 
slicing theories. 

Can CS-I-V theory be formulated in terms of one auxiliary alone, like conformal grav- 
ity? Does our CS-I-V shed new light supporting York's contention that the two conformal 
shape d.o.f's of CS are, together with V , a representation of the true dynamical d.o.f's in 
CWB GR? Finally why does the Lichnerowicz-York equation arise from demanding the 
conformalized GR action to encode the LFE? Some of these issues are further pursued in 
V.2. 



App III.2.B Other scale-invariant theories 



There are two important differences between tfie manner in whicli conformal covariance is 
acliieved in conformal gravity and the two best known earher attempts to create conformally 
covariant theories: Weyl's 1917 theory |368j and Dirac's simphfied modification of it jll3j . 
First, both of these earher theories are spacetime theories, and their conformal covariance 
leaves 4-d general covariance intact. In conformal gravity and in York's representation of 
GR, there is an element of the absolute. Second, the conformal covariance is achieved in the 
theories of Weyl and Dirac through a compensating field that is conformally transformed 
with the gravitational field. In Weyl's theory, the compensating field is a 4-vector field that 
Weyl identified as the electromagnetic field until Einstein jl25j pointed out that atomic 
spectra would then be path-dependent, in contradiction with astronomical observations. 
Weyl later reinterpreted the idea of a compensating field in his effective creation of gauge 
theory [A72\ . but he never salvaged his original theory. In Dirac's simplification, the com- 
pensating field is the additional scalar field in Brans-Dicke theory ,68_ . This possibility has 
been exploited more recently in theories with a dilatonic field ^367, 68 . In contrast, con- 
formal gravity has no physical compensating field; the variable (f) ()414() is a purely auxiliary 
gauge variable used to implement Conf-BM. This is therefore a more radical approach, in 
which full scale and conformal invariance of the gravitational field by itself is achieved. In 
checking the literature on Weyl's theory, we came across Einstein's 1921 paper |126j . in 
which he attempts to implement an idea similar to ours, albeit the implementation itself is 
very different. Einstein follows Weyl in employing only ratios of the 4-metric components, 
but drops the idea of a compensating field. Thus he is also aiming for a gravitational field 
that is scale invariant by itself. The implementation Einstein proposed was very tentative, 
and we are not aware of anyone attempting to further develop it. 



IV TSA: coupling of bosonic matter 



We now seek to convert the TSA to a route to relativity with matter 'added on'. In 
IV. 1.1 we consider the analysis of BFO concerning scalars and the apparent selection of 
electromagnetism among the possible single 1-form theories. In IV. 1.2 we consider the 
theorem that underlaid my calculation with Barbour ^U] about the apparent selection of 
Yang-Mills theory among the possible many interacting 1-form theories. Consider this as 
a derivation of the standard material provided for comparison in 1.1.7. 

A tidier version of all the calculations for minimally coupled scalars and many interacting 
1-forms is presented in IV. 1.3 as following from the theorem. The strong gravity counterpart 
of these calculations [3] is presented in IV. 1.4, and compared with the GR results. 

The theorem is modified in IV. 2.1 for use in conformal gravity, and a further theorem is 
built guaranteeing there is no integral inconsistency arising from the LFE that propagates 
the new p = constraint, for the broad class of homogeneous actions considered. In IV. 2. 2 
we show all the matter fields above can be coupled to conformal gravity. It is interesting that 
locally Lorentz-invariant physics drops out of a non-generally covariant theory. I include two 
technical appendices IV. A on naive renormalizability and IV. B on the Gell-Mann-Glashow 
theorem (which is used in both the TSA and the standard derivations of Yang-Mills theory) . 
App IV. C provides for comparison Teitelboim's derivation of Yang-Mills theory by the HKT 
route. 

For future reference of related material, the existence of suitable bosonic matter fields 
in the TSA is much strengthened in VI by an indirect method. This however does not have 
the constructive status of the method of this chapter. Following the criticism in VI and 
suitable reshaping of the whole program, I succeed in including also spin-^ fermions, in fact 
I include a full set of matter fields that suffice to describe nature as we know it. There 
are more matter issues in VII. Overall, these later works show that the TSA is certainly 
not a unification and that the matter fields are picked out a great deal less uniquely than 
originally thought, although the TSA still does appear to be selective. 

1 Matter in TSA formulation of GR 
1.1 Original BFO work 
1.1.1 Scalar fields 

Barbour, Foster and O Murchadha included a scalar field ? by considering the action 

l^g^ = y dA y x^/hVRT(F Vt^Tt^ (470) 
with the gravitationally BM scalar kinetic term^'' T"^ = (B^?)^ and the potential ansatz 

Defining 2A^ = ^J^^^^, the conjugate momenta are given by (|.S18|) and 

Then the local square root gives as a primary constraint a Hamiltonian-type constraint 

= Vh{R + U^) - (^p o p - lp2 ^ vr^^ = . (472) 

^'^NB the U's and T's in this chapter differ by constants from the standard spacetime forms. This has no 
effect on resuHs. I leave them as they are to reflect the emergent character of the standard physics from the 
TSA point of view. 



^'-variation gives as a secondary constraint the momentum constraint 



'Hi = -2Djp>i + 7rai<j = . (473) 

The ELE's are 

^ = iL = VhN{h'^R - R'^) - ^{p'^pj - Ip'^p) + Vh{+DW'N - h'W^N) 
OA ohij y/h 2 

+ ^^^ (OVfflc^ - aa?5\/i*^ ) + A(„)<j(")/i*J' + £^p'^ , (474) 

(n) 

^ = Y = ^D\Nd,,) + VhNY^nA^n)^--' + . (475) 

(n) 

The constraint ""Ti contains the canonical propagation speed C of the scalar field. This 
can be read off the coupled form of (|471|) and (|475|) : 

^x{wi^X- "^^^) } = ^^A(iVaV) + 2^/^iV^ + 2£,. . (476) 

In the TSA, a priori, C ^ 1, which means there is no reason for the scalar field to obey the 
same null cone as gravity. However, propagating ''Ti gives 

= -£ii^ + ^ + A( '«) + ^O-(A'VaV) . (477) 

The theory has just one scalar d.o.f, so if the cofactor of (1 — C) in the last term were zero, 
the scalar dynamics would be trivial.''^ Thus C = 1: scalar fields must obey the null cone 
dictated by gravity. The counterpart in the standard approach is that C = 1 due to RPl. 
The point is that BFO deduce this as enforced by gravitation. 

Notice also that this scheme gives minimal coupling of the scalar field to gravity. How- 
ever, there is one other possibility because the most general ultralocal kinetic term includes 
also a metric-scalar cross-term. This gives Brans-Dicke theory. This was considered in the 
revised version of RWR, but I am not satisfied with that treatment since it is based on a 
conformal transformation (which does not preserve geodesies). I present a satisfactory and 
in fact more illuminating treatment in VI. 1.4. 

1.1.2 A single 1-form field 

To include a single 1-form field Aa, BFO considered the action 

= y dA y d^xVhVR + U^Vjs -1- (478) 

for T^ = h^'^a^AaQ^Ah the quadratic gravitationally best-matched kinetic term of Aa, and 
the potential ansatz = CiDbAaD^A''+C2DbAaD''A^+C3D''AaD''Ah+J2(k) B(k){AaA'')^ . 
The first part of this can be expressed more conveniently for some purposes by using a gen- 
erahzed supermetric C'""^ = Cih^^'h'"^ + Cik'^'^h!'^ + C^h'^^h^'^. 

Defining 2N = \[^^^k-i the conjugate momenta are given by (|318|1 and 



vr^ - ^ = ^A^' ■ (479) 



^^This argument requires further elaboration, which is provided in V.l. 



Then, the local square root gives as a primary constraint a Hamiltonian-type constraint 

^*-variation gives as a secondary constraint the momentum constraint 

^Hi = -2Djp>i + Ti\D^A, - DiA^) - D^Tr^Ai = . (481) 
Then, propagating gives^^ 

n = + + n 

+^ {(4Ci + l)Db{N\''D'Aa) + (4C72 - l)DbiN\''DaA'') + AC3Da{N\^ DbA')} 

-l-D,{N^Da7T-A'') - 1 [N^Da (pij - ^hij) DdA,{2A'C^^''^ - A'^C'^'"')] . (482) 

Now, the system has a priori 5 d.o.f's per space point, that is 2 geometric d.o.f's and the 
3 d.o.f's of the 1-form field itself. The constraints cannot include N, so the penultimate 
line includes a 3-vector of constraints multiplied by d"'N, which would take away all the 
1-form d.o.f's, thus rendering a trivial theory, unless the cofactor of d"'N vanishes strongly. 
This gives a nontrivial theory only for Ci = —C2 = — |, C3 = and if there is a secondary 
constraint 

g = DaT^" = . (483) 

The conditions on the Cs mean that the 1-form field obeys the null cone dictated by 
gravity, and furthermore that the derivative terms in Ua are —\\d x Ap. We identify the 
new constraint (|483|) as the Gauss constraint of electromagnetism (|62l I205j) . 

d { d-n" 

In order to propagate the new constraint Q, first note that "aT(-^a'^'*) ~ 



because vr" is a (1,0) density. Thus, 

^{D^7T') = 2VhDa i^N J2 kB(^kM'' Aa)"-^ A^^ + £dDi7r') (484) 

where we have used d-d x A = and mixed partial equality to eliminate a further term 

^D.D'iNiD.A^-D'A,)] . (485) 

Again, one can argue that constraints cannot depend on A'^, and then that the only way 
of avoiding triviality of the 1-form field due to the terms in daN is to have all the -B(fc) be 
zero. In particular, = means that the 1-form field must be massless. 

Now, the allowed form U"^ = -iDbAbiD^A" - D^A^) [ci (|T77jl ]. is invariant under the 

gauge transformation Aa — > Aa+da^ , 

(486) 

so we are dealing with a gauge theory. Note first how the gauge theory and the fixing of 
the light-cone to be equal to the gravity-cone arise together in the same part of the above 
calculation. These are two aspects of the same consistency condition arising from the role 

^■^I do not provide the cumbersome ELE's since I am to build a method in the next section which does 
not require their explicit use. 



of the momentum constraint in the propagation of the Hamiltonian constraint. Second, 
because we have a gauge symmetry, if we introduce an auxihary variable ^ into T"^ such 
that variation with respect to it encodes Q, then we should do so according to U(1)-BM. 

This uniquely fixes the form of T^(A,^>) to be = {Aa- £^Aa-da^){A'' - £^A'' -d^^) 

(487) 

[c.f (|177|) ]. Thus, if one identifies ^ as this derivation forces A = [^OjA] to obey 
Maxwell's equations minimally-coupled to gravity. Moreover, as pointed out by Giulini 
|158l and reported in the massive (Proca) 1-form field does not fit into this TSA 
formulation despite being a perfectly good generally covariant theory. BFO originally took 
this to be evidence that the TSA does not yield all generally covariant theories. This is 
further explored in later sections and chapters. My main aim in this chapter is to show that 
the TSA does at least permit standard fundamental bosonic field theories. 

Finally, we mention that on attempting to couple Aa to scalar fields by the inclusion 
of interaction terms, BFO showed similarly that demanding the propagation of ^'""Ti, and 
of any secondary constraints arising from it, leads to U(l) gauge theory minimally-coupled 
to GR 39_. We have thus a chain of successively more sophisticated theories, each arising 
from its predecessor by iteration of constraint propagation consistency. This provides a 
different means of deriving classical physics: Dirac's work, applied to BM RI actions, leads 
to a striking alternative to Einstein and Minkowski's 4-d foundation of physics. I will next 
build the above outline of Relativity without Relativity into an almost algorithmic formality, 
from which Yang-Mills gauge theory will emerge from allowing a general collection of 1-form 
fields to interact with each other. 

1.2 Matter workings from the perspective of a general theorem 

We next find that Yang-Mills theory minimally coupled to GR emerges as one of a few 
possibilities allowed for quite a general ansatz for the 1-form fields' potential. More precisely, 
the 1-form fields are again found to respect the gravitational null cone, to be fundamentally 
massless, and to have a Yang-Mills type mutual interaction. The calculation if done in the 
above style is much messier than that for a single 1-form field. We present it rather by use 
of a theorem that uses the general ELE's and permits a clear term-by-term computation 
of the propagation of the Hamiltonian constraint. This theorem-based approach is good in 
being more systematic than the original RWR-type treatment. 

We conclude that, within the bosonic sector, the TSA yields the key features of the 
observed world. Gravity, the universal null cone, and gauge theory all arise in essentially the 
same manner through the single mechanism of consistent Dirac-type constraint propagation 
applied to the interplay of BM with the local square root. Some field-theoretic issues are 
addressed in the conclusion. 

I consider and U* to consist of contributions from each of the matter fields ^'a 
present. I then obtain the following formulae for the propagation of the Hamiltonian con- 
straint. 

Theorem 1 

i) For nonderivative coupled matter fields ^'a with T* homogeneously quadratic in ^'a and 
U* containing at most first-order derivatives, 

where in this chapter and the next [ ] denotes the extent of applicability of the functional 
derivative within. Gab is an invertible ultralocal kinetic metric and H''^ is the momentum 
conjugate to ^a- 



ii) If, additionally, the potential contains covariant derivatives, then there is an extra 



contribution to i): A {iV^ (p, - f (|^.« - } . (489) 

The proof offered here includes both GR (cr = 1, 1^ = 1) and strong gravity (cr = 
0, A / 0, 1^ 7^ I but otherwise arbitrary). Result i) in the GR case is related to a 
result of Teitelboim j348l ^351^ that the contributions of nonderivatively-coupled fields to 
the Hamiltonian and momentum constraints independently satisfy the Dirac Algebra (see 
App IV. C). In the working below, this is reflected by our ability to split the working into 
pure gravity and matter parts. 

Use of formulae i), ii) permits the calculations to be done without explicitly com- 
puting each case's ELE's. This is because our derivation uses once and for all the general 
ELE's. 

Proof The working can be split into its ^*-free and ^* parts (the latter is not exhibited), 
i) For a homogeneous quadratic kinetic term T* = fi^^A%^BG'''^^(/i'-') , 



(490) 



the coniugate momenta are 11''^ = — ■. — = G^^^e'^R . 

The ,^*-variation gives as a secondary constraint the momentum constraint 



(491) 



^ 2D,p,i - n^^^^^ = (492) 
and the local square root gives as a primary constraint a Hamiltonian-type constraint 

-^n = VhiaR + A + u*) - -^{p o p - + GABn^n^) = o. (493) 

Then -*7^« V)I(ai? + A + U*) - (^p o p - + G^bTI^U^^ 

+ VhiaR + U*) - jp'J (^p^j - ^phij^ +p'^p^^ {hikhji - ^hjhk^ | 

- -^(2ri^GABn^ + GABH^n^), (494) 

using the chain-rule on ()493() . Now use the chain-rule on U , the ELE's p^^ = and 
= to obtain the first step below: 



'^n = ^-^NpiaR + A + U*) + ^^iVp (pop-^p' + GabH^H^ 

[d^B 5(1)6 4'b) dhab J 
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+ 



dhij 
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(495) 

In the second step above, I regroup the terms into pure gravity terms and matter terms, ex- 
pand the matter variational derivatives and use the definitions of the momenta to ehminate 
the velocities in the first three matter terms. I now observe that the first and sixth matter 
terms cancel, as do the third and fourth. In the third step I discard a term proportional 
to ^Tl, use the momentum constraint H492|) . and the pure gravity working (N.B we assume 
that either = 1 or o" = 0, so the pure gravity working does work!), and the definitions of 
the momenta to cancel the fifth and eight terms of step 2. Factorization of step 3 gives the 
result. 

ii) Now —^Ti. has 2 additional contributions in step 2 due to the presence of the con- 
nections: 

2 



(9U* / X 

Vh^:^r\, - 2 pij - -ph 



be 



be 



dT'^bc Shi 



N 



(496) 



which, using (|168() . ()346l) . integration by parts on the second term of H496|) and factorization 
yields ii). □ 

1.3 Examples 

1.3.1 Minimally-coupled scalar fields 

Application of formula i) to the minimally-coupled scalar field action ()47Up immediately 



yields that 



i-c 

N 



D\x\,db^) 



(497) 



1.3.2 K Interacting 1-form Fields 

We consider a BSW-type action containing the a priori unrestricted 1-form fields 

|Ai^ = j d\ j d^xVh\-{hij,h}\A],A^^,X,C) = j "^^j d^xVhVn + U^i x/t^ + T^i . 

(498) 



We use the most general homogeneous quadratic BM kinetic term , and a general ansatz 
for the potential term U"^'. We constructed these using both the inverse 3-metric /i"^ and 
the antisymmetric tensor density e'^'"^. 

We note that no kinetic cross-term habAic is possible within this ansatz. This is because 
the only way to contract 3 spatial indices is to use e""^^, and hat is symmetric. Then T^' is 

unambiguously T^i = Pijh'''^{Al-£^Al){Ai-£^Aj) 

(499) 

for Pij without loss of generality a symmetric constant matrix. We will assume that Pij is 
positive-definite so that the local flat-space limit quantum theory of A\ has a well-behaved 
inner product. In this case, we can take Pij = 6ij by rescaling the 1-form fields. 

We consider the most general U^i up to first derivatives of Aia, and up to four spatial 
index contractions. This is equivalent to the necessary naive power-counting requirement 
for the renormalizability of any emergent 4-d quantum field theory for Aja (see App IV. A). 
Then U"^! has the form 

U^i = OiKC^'^^DhAlD^Af + B^jKC'^'^'^DhAiaAiAf + Ij^LmC^'^'AiAfA^Af 



+^e^'^{ZjKDbAlAf + EuKAlAiAf) + Fik'^'D^Al + MjKh^'AlAf (500) 

where C, C and C are general ultralocal supermetrics, each with distinct coefficients. Oik, 
-B^jK, -^^JKLM, -^IK) -^'IJK, -^i and Mik are constant arbitrary arrays. W.l.o.g, Oik and 
MiK are symmetric, and Eijj^ is totally antisymmetric. 

Defining 2N = \f^^^^ the conjugate momenta are given by H318|) and 

dA} 2A^ 



The local square root gives as a primary constraint the Hamiltonian constraint 

^ \/^(i? + Uaj) - ^ (pop-^p^ + njnl^ = . 



(502) 



^*-variation gives as a secondary constraint the momentum constraint 

Ai^. = -2Djp^i + TT^'{D,Au - D,Ai,) - D.^IA] = . (503) 
By the theorem, the propagation of the Hamiltonian constraint is 

m = + + m 



N 



+ACsO^'^DaiN\fDhA'i^)] - ^DhiN^DaTTfA^') 



N 

+^C'''''''B^jKD,{N\jaAiAf) + ^e^''^Z-iKD,{N\lAf) 
+lF'D\N\ii) - ^O'^^Da [N\pi, - ^h,,)DdAKb{2A\C'^^''' - AfC'^'^)] 
-l^B^jKDa {N\pij - ^h,,)AiAfi2A\C-^''' - AfC'^'"')} 

-^F'Da [n\p,, - ^h,){2A\h''^ - Afh'n} . (504) 



We demand that "^'^'H vanishes weakly. Supposing that this does not automatically vanish, 
then we would require new constraints. However, we have at most 2 + 3K d.o.f's, so if we 
had 3K or more new constraints, the 1-form field theory would be trivial. Furthermore, all 
constraints must be independent of A^. Thus, terms in d°'N must be of the form {d°'NVia)S^ 
for the theory to be nontrivial [and we cannot have more than 3K independent (spatial) 
scalar constraint factors in total]. Most of these scalars will vanish strongly, which 
means they will fix coefficients in the potential ansatz. Finally, (|5fl4jl is such that all the 
non-automatically vanishing terms in are partnered by terms in d"'N . So the above big 
restriction on the terms in d°'N affects all the terms. 

1) The first, second, third, sixth and seventh extra terms have no nontrivial scalar 
factors, thus forcing O^^ = S^'^ , Ci = -C2 = -\, = 0, Zjk = and Fj = . 

2) This automatically implies that the eighth and tenth terms also vanish. The condi- 
tions on the C's correspond to the 1-form fields obeying the null cone dictated by gravity. 
The only nontrivial possibilities for the vanishing of the ninth term are C3 = and either 
i?i(jK) = or C*! = —C2 = — |, say. In fact, these are equivalent, by the following lemma. 

Lemma Given that C's = 0, -Bi(jk) = 44^ Ci = — 6*2. 

(505) 

Proof: Since C's = 0, the potential term in question has a factor of 

Af (Ci, C2) = BujiAlAf{Cih''^h^'^ + C2h'"^h'"') . (506) 

(<^) The last factor in Af'{Ci, —Ci) is manifestly antisymmetric in c and d, but Bk^jj^^A^A^ 
is symmetric in c and d. So one can take -Bi(jk) = in this potential term. 
(^) Af'{Ci,C2) = —Af^{C2,Ci) by the antisymmetry in Sjjk and exchanging dummy 
internal and spatial indices in (f^O^ . Hence BuicA^Af^Ci + C2){h'"'h}"^ + h^'^h^") = 0. 
Then the only nontrivial possibility is Ci -|- 6*2 = 0. □ 

There is in fact another possibility for the vanishing of the ninth term, 

but we will now dismiss it. Because the last factor is symmetric in ij, /C"^^ is symmetric 
in JK, and so uses up "^^ d.o.f's, so the theory is trivial for K > 5. Otherwise, we 

require this new constraint to propagate 

—Q^ = — Y 71 * Ai ) [p-' - -h-' 

+AiAf {p^^ - ^p'^-'hai^ - ^A'^A^^p-^ - . (508) 

But p*-' alone contains d^N terms, so there must be at least 3 more constraints, and we 
discover yet 3 more below, so this theory is trivial. 

3) So we are finally left with K new scalar constraint factors from the fourth and fifth 
terms, 

Q3 = DaTTj - gBuKTrlA"^" « . (509) 

Next, we examine the evolution of this internal-index vector of new constraints. We do 
now require the A} ELE, but we can simplify it by use of the restrictions imposed by ~ 
and with some regrouping of terms by dummy index exchanges, to obtain^^ 

-j^ = Y^ = -2^0''^D'{Ci{ND,A\^) + C2D'{ND^AKb)} 
OA OAji 



^■^The corresponding part of the calculation uses 



+Vh{NDbAlAMc{C''''"Bi^^ + C'^^'^c^jJM) _ DbiNA^Af)&^'''^B^MK} 

+Se'^^E^^^NAKcAf^b + 2VhNM^^Ai^ + £^7r^\ (510) 
Then, using the fact that vrj is a (1, 0)-density, the propagation of Qj gives 

Gi = £^gi - '^gTrfTT^'BuK + ^gD'A'^'iDhAi, - DiAib)B^jj^ 
+VhDi{NA'''^A'^Af) {di(/jKLM + /kjml + /lmjk + /mlkj) 

+<52(-fjKML + -^KJLM + -^MLJK + -^LMKj) + 2(73 (/(jl)km + -^KMpL)) — ^S^^^JK-BimlI 

— -^g^NA^^A^^DiAf^{B^j-ti_Bii,M + ^^jm-Bikl + -B^jl-Bimk) 
-B'^jPgA^^A'^'^AfAf {di(/QKLM + ^KLMQ + ^LKQM + ^LKMq) 
+(52(^KQLM + ^KLQM + -^QKML + -^KQMl) + 2d3(/KM(QL) + -^(QL)Km)| 

+3e""^ {Ej^KD,{NAfA^) + gEQ^j^N Af A^ Af B"^ jp} 

+2Vh{MjKDi{NA'^')-gMciKB'^jpNA'^'Af} . (511) 
In obtaining this resuh, we have used the following cancellations in addition to those which 



occur for the single 1-form case: DiDb{NAi^A'ii)B^^^ = 



(512) 



Di{N{D'Au - DbA\)Al}B^'-''^^ = . (513) 

These follow from the last two indices of B^jk being antisymmetric. 

We next demand that (|511j) vanishes weakly. Again, we will first consider the d'^N terms. 
For the theory to be nontrivial, the third, sixth and seventh non-automatically vanishing 
terms force us to have, without loss of generality, /jklm = -B^jk-Bilm, = —Ci = 

= 0, -EjNK = and Mjk = 0. This last condition means that, within this ansatz, 
fundamental interacting 1-form fields are massless. We are then left with the first, second, 
fourth and fifth terms. The fourth term forces upon us 

-B^JK-BlLM + ^^JM-BlKL + ^^JL-BlMK = , (514) 

which is the Jacobi identity (|116|) . so the -B^jk's are axiomatically the structure constants 
of some Lie algebra, A . Thus (|5()9|1 can be identified with the Yang-Mills-Gauss constraint 
(|125I2U9|) . Furthermore, the vanishing of the first term forces us to have i?ijK = -B[i|j|k]) 
which means that the -Bijk's are totally antisymmetric. The remaining terms are then 
automatically zero: the second term vanishes trivially because of the new antisymmetry, 
and the fifth term vanishes by the following argument. Writing for the symmetric 
matrix A^A^^, and rearranging the 1)514^ . 

R rI rP cKF qMG _ / d rI rI R ^ rP qKF c-MG 



By means of a KF <-> MG dummy index change, 3 uses of antisymmetry and a I ■f-^- P 
dummy index change, the second term on the right-hand side is equal to its negative and 

hence is zero: ^^jp^ikm^^fgS^^S^'^ = . 

(516) 

Hence we have the symmetry ^uk^Vp^^fg^^^^^*^ = -Bij(k|^V)p^^fg5^^5^'^ 

but then the dummy index change KF <^ MG and antisymmetry in the third structure 
constant along with symmetry between the two S'''^^'s means that the right-hand side of 
the above is zero as required. 

So the potential term must be [c.f (|178() ] 

C/aj = -^iDbAl - DaAl + gB^jKAiAf)iD'Af - D^A^ + gSiLM^"^"^^') ■ (517) 

We now investigate the meaning of totally antisymmetric structure constants -Bijk • This 
involves expanding on the account in 1.1.7.3 of the standard approach to Yang-Mills theory 
in flat spacetime, where one starts with Lorentz and parity invariance, which restricts the 
Lagrangian to be L^i = —QabF^ o F^. Furthermore, one demands invariance 6L = 

under the gauge transformation A\ — > A\+ig^f^ jj^A"^ A^ . 

(518) 

Using the standard result under a set of infinitesimal transformations parameterized by 



6L = yields Qab/^cd-^'^ ° F^ — 0) which is equivalent to Q(A|b/^c|d) = 

(519) 

by symmetry in the internal indices of o F^. 

For Qab positive-definite, there is the following theorem |15n[ I364j 

Gell— Mann Glashow Theorem : (|519j) and the following two statements are equivalent. 

3 basis in which /abc = /[abc] (520) 

The corresponding algebra is a direct sum of compact simple and U(l) Lie subalgebras. 

(521) 

The Lie algebra terminology used and the proof of the theorem are summarized in App 
IV. B. Also, ()519() 4^ ()52U() 44> ~ in the usual flat spacetime canonical working. 

Although we get Qj to do the usual work, we arrive at the above from a different angle. 
We started with 3-d 1-form fields on 3-geometries, obtained TC as an identity and demanded 
that W ~ 0, which has forced us to have the secondary constraints Qj. But once we have 
the Gj, we can use 44> ()52U|) 44> ()519() . so our scheme allows the usual restriction 

(|52H) on the type of Lie algebra. We can moreover take (|519|) to be equivalent to the gauge 

invariance of Uaj under — > A^^+ig^f^j-^^A'^A^ 

(522) 

Thus, if we introduce K auxiliary variables such that variation with respect to them 
encodes Qj^ then we should do so according to G-BM. In our emergent notation, this 
uniquely fixes the form of J^^{Aia, ^j) to be [c.f (fT78|) ] 

jM = h-'i(Al-£^Al-da<^>^ + gB^jj^Ai^^){Au-£^AM-dd^i + gBii^MA^^'^) . (523) 



Finally, if we identify with ^4^^, we arrive at Yang-Mills theory for A = [A^,A^], 
with coupling constant g and gauge group G (corresponding to the structure constants 
B^jk)- So this work constitutes a derivation, from 3-d principles alone, of Yang-Mills 
theory minimally-coupled to GR. 

One may argue also that the auxiliary variables should require to be Diff-BM. An in- 
vestigation of this however shows that this is weakly equivalent to not applying BM to the 
auxiliary variables. A number of interesting points arise from this line of thinking. 

First, consider the toy example of particles for which all velocities involved are Eucl-BM. 
From , Q the translational and rotational actions on each vector velocity are of the form 



T],v = v-is , (524) 
R^v = v-Q X V . (525) 

So the BM for each particle could in fact take the form q^^^ — > q^^-^— a— (b— a) x q^^^ . 

(526) 



Then free-endpoint variations with respect to a and b yield M + C = and £ = respec- 
tively, so things keep on working out as before. 

What is and is not BM has a strong physical impact and is put in by hand. Each field 
has associated charges that permit it to feel some forces but not others. 

In the current situation, the electromagnetic auxiliary H picks up a Diff-BM correction: 

Aa^Aa-£i:Aa-da{^-£^^). (527) 

Unlike above, there is an extra complication: this inclusion involves the previously cyclic co- 
ordinate itself appearing. This is unproblematic: the EL equations, definition of momentum 
and the free endpoint equation ensure this works out. Finally ^*-variation (or free-endpoint 

s*-variation for = ^*) yields the weak equivalent ^TCi—QdiE = 

(528) 

of the momentum constraint. Yang-Mills theory works along the same lines. 
1.3.3 Discussion 

This work shows that the TSA can accommodate many examples of physical theories. We 
can immediately write down a gravity-coupled formalism with the SU(3) gauge group of 
the strong force, or with larger groups such as SU(5) or 0(10) of grand unified theories. 
However, the work does not restrict attention to a single simple gauge group, since it also 
holds for the direct sum ()521|) . As provided in App IV. B as a corollary to the Gell-Mann- 
Glashow theorem, we can then rescale the structure constants of each U(l) or compact 
simple subalgebra separately, which is equivalent to each subalgebra having a distinct cou- 
pling constant |364j . The simplest example of this is to have -B^jk = 0, which corresponds 
to K non-interacting copies of electromagnetism. Other examples include the gauge bosons 
of (unbroken) SU(2) x U(l) electroweak theory and of the SU(3) x SU(2) x U(l) Standard 
Model. 

We emphasize that our formalism cannot predict how many of these gauge fields there 
are in nature, nor what their gauge groups are. This is to be expected once one accepts 
that the TSA is 'adding on matter' rather than a unification (see also VI and VII). 

BFO showed that a scalar field, a 1-form field, and a 1-form field coupled to scalar fields 
all obey the same null cone as gravity. In this section we have shown that this is also true 



for K interacting 1-form fields, thus providing more evidence for a universal null cone for 
the bosonic fields, derived entirely from 3-d principles. Investigation of the fermionic sector 
would tell us whether this null cone is indeed universal for all the known fields of nature. We 
also note that our formalism reveals that the universality of the null cone and gauge theory 
have a common origin resulting from the universal application of Diff-BM in conjunction 
with the need to propagate the quadratic Hamiltonian constraint. 

In this TSA formulation, fundamental 1-form fields are not allowed to have mass. The 
only bosonic fields allowed to have mass are scalar fields. This would make spontaneous 
symmetry breaking a necessity if we are to describe the real world, since the weak W~^, 
W~ and Z bosons are massive. I then speculated that it might be that (Higgs) scalars 
alone are allowed to have mass in the TSA. However, I showed that strong gravity easily 
accommodates massive 1-forms (IV. 1.4) and the spin-^ fermion mass term causes no trouble 
(VI. 4). Finally I figured out how to include massive 1-forms in another particular TSA 
formulation (VII). 

We finally consider whether classical topological terms can be accommodated in the 
TSA. Although it is not free of controversy |287|I294] . t'Hooft's standard explanation of the 
low energy QCD spectrum makes use of an extra topological term 



in the classical Lagrangian |188j . to avoid the U(l) Problem. This consists of a peak observed 
to be unsplit but theoretically expected to be split due to a U(l) symmetry. The topological 
term resolves the U(l) Problem by breaking the U(l) symmetry. Then the parameter Q is 
constrained to be small (|0| < 10~^) by the non-observation of the neutron dipole moment 
[2] . The inclusion of the topological term corresponds to dropping the parity-invariance of 
the Lagrangian. There is then a new strong CP Problem: if this symmetry is broken, why is 
it broken so weakly? The topological term is a total derivative j3()4j . Nevertheless it makes 
a contribution to the action when the QCD vacuum is nontrivial. 

We argue also that we need not yet confront the accommodation of topological terms, 
because so far we are only describing a classical, unbroken, fermion-free world. But the need 
for the new term arises from QM considerations when massive quarks are present j287j . On 
the long run, it is not clear to me whether spatial compactness and other topological features 
associated with curved spaces can affect the global notions used in particle physics. 

1.4 Coupling matter to strong gravity 

I now attempt to couple matter to the strong gravity TSA theories, following the procedure 
of BFO. This enables comparison with the GR case, and leads to a better understanding of 
how the TSA works. In particular 1) strong gravity theories impose an ultralocal structure 
rather than a Lorentz one, and they cause the breakdown of gauge theory, which reinforces 
BFO's notion that the null cone and gauge theory have a common origin in GR. 2) This helps 
clarify the central role of the differential Gauss laws of electromagnetism and Yang-Mills 
theory in the masslessness of 1-form fields in the TSA. 

1.4.1 Scalar fields 

I include first a single scalar field by considering the action 



Sstrong 



(529) 



327r2 



(strong),? 

'bsw 



/ 



dA / d^xV^VA + U^-^Tfv + T^ 



(530) 



with T'" and U'" as before. 



The conjugate momenta are given by the usual expressions (|318p and H471|) . where now 



2N = y -^pl)?-- The local square root gives the primary Hamiltonian constraint 



(531) 



^*-variation gives the secondary momentum constraint ()473() . 

The constraint ''Ti contains the canonical propagation speed VC of the scalar field. A 
priori, this is unrestricted. However, the propagation of the Hamiltonian constraint gives 

The theory has just one scalar d.o.f, so if the cofactor of C in the last term were zero, the 
scalar dynamics would be trivial. Thus I have derived that C = 0: the scalar field theory 
cannot have any spatial derivatives. So, strong gravity necessarily induces the Carroll group 
structure on scalar fields present, thereby forcing them to obey ultralocal field theory. This 
is analogous to how GR imposes the null cone structure on scalar fields present in |38j . 

I finally note that these results (and those in the next subsection) are unaffected by 
whether one chooses to use the gravitationally 'bare' instead of the gravitationally BM 
formulation. 

1.4.2 K interacting 1-form fields 

I consider a BSW-type action containing the a priori unrestricted 1-form fields I = 1 
to K, 



I (strong), Ai 

'bsw 



y dA y d?xy/h\.{hij,h}^ ,A\,A^^,N,e) = j j d^^^VA+U^i^T|, + T^i 

(533) 

where T^i and U"^! are the ansatze used before. 

___ /jg 4.JA1 

The conjugate momenta are given by ()318|) and ()5U1() where now 2N = y ^yA; • The 

local square root gives the primary Hamiltonian constraint, 



^^n = ^(^pop- ^ ^i^j j _ ^^(^ ^ u^i) = . (534) 



+ 7r,^7ri^ -^/^(A + U^i) = . 



I get the secondary momentum constraint (|5()3j) by ^*-variation. 

From the strong subcase of the theorem, the evolution of the Hamiltonian constraint is 
then 

. {3X-2)Npn 

rt = ;= h Xf /T 

2Vh * 

-IqIK ^CiD,{N\lD''A^a) + C2D,{N^nlDaA\^) + CMN^t^ID^A"^)^ 
-^C-''''B^jKD,{N\jaAiAf) + ^e-''ZjKDt{N\lAf) + ^f'd\n\j,) 
+^0''^Da I (^p^^ _ ^/.^^.^ D^Aj^, (2A\C-^'^ - AfC'^'^) | 



+^B^jKDa jiV^ [p., - ^h,,^ AiAf (2A{C''^''' - AiC'^''') | 



+^F'Da In' - ] {2A\h'^^ - Afh^n \ . (535) 



I demand that "^iTt^ vanishes weakly. The first two terms vanish weakly by the Hamiltonian 
constraint, leaving us with nine extra terms. Because there are less than 3K 1-form d.o.f's 
to use up, nontriviality dictates that most of these extra terms can only vanish strongly, that 
is by fixing coefficients in the potential ansatz. Furthermore, I notice that all contributions 
to H535() are terms in ^"A'^ or are partnered by such terms. Since further constraints are 
independent of A^, these terms in d°'N are of the form {d°'NVja)S'^ , and nontriviality 
dictates that it must be the (spatial) scalar factors S"^ that vanish. I proceed in three steps. 
1') The first, second, third, fifth and sixth non-weakly-vanishing terms have no nontrivial 
scalar factors, so we are forced to have = 6^^, Ci = C2 = C3 = 0, Dik = and Fi = 0. 
The conditions on the C"s correspond to the 1-forms obeying the local Carroll structure. 
2') This automatically implies that the seventh, eighth and ninth terms also vanish. 
3') The only nontrivial possibility for the vanishing of the fourth term is if i?ijK = 0, in 
which case the constraint algebra has been closed. 

It is enlightening to contrast these (primed) steps with their (unprimed) counterparts 
from the GR case of IV. 1.2. 

1) is the same as 1') except that Ci = —C2 = —1/4, which corresponds to the 1-form fields 
obeying the local Lorentz light-cone structure 

2) is the same as 2') except that instead of the automatic vanishing of the eighth term, 
one is forced to take .Bijk = B^jiq , which is the start of the imposition of an algebraic 
structure on the hitherto unknown arrays. 

3) One is now left with K new nontrivial scalar constraints, which happen to form the 
Yang-Mills Gauss constraint 1)5091) . So the algebra is not yet closed, and the GR working 
is then substantially longer. 

1.4.3 Discussion of strong gravity matter-coupling results 

The above results help clarify some aspects of the TSA results for GR. First, notice also 
how now that a family of supermetrics is allowed the matter dynamics is insensitive to a 
possible change of supermetric, which is encouraging for the coupling of conformal gravity 
to matter fields. 

Second, we can take further the view that local causal structure and gauge theory are 
manifestations of the same thing. In the GR case, the universal light-cone and gauge theory 
come together from the R term in ^^Ti, whilst the absence of this in strong gravity ensures 
that the collapse of the null cone to the Carrollian line is accompanied by the breakdown 
of gauge theory: there is neither gauge symmetry nor a Gauss law. In the GR case, the 
quantum-mechanics-inspired positive-definiteness assumed of the 1-form kinetic matrix Pjj 
then turns out to be necessary in the restriction of the choice of gauge group, so there would 
be a price to pay if one insisted instead on entirely classical assumptions. In the strong 
gravity case, the absence of emergent gauge structure means that there is no such price to 
pay for using classical assumptions alone. Provided that Pu is invertible, the outcome of 
steps 1') to 3') is unaltered. 

Note through what happens above in the absence of the Gauss law that it is specifically 
this characteristic of the 1-form theory that kills off its mass terms in the TSA of BFO, 
rather than some underlying principle for general matter. This is a useful first insight into 
the status of mass in the 3-space approach to GR. It is also easy to demonstrate that the 
general derivative-free potential term built out of 1-forms persists coupled to strong gravity. 

I emphasize that our result concerning the breakdown of gauge theory is in particular a 
result about GR, although it clearly occurs for all our theories and the theories they approx- 
imate. In the strong-coupled gravity limit such as in the vicinity of the initial singularity, in 
this formalism dynamical consistency dictates that gauge theory breaks down in GR. Gauge 
interactions become impossible as one approaches such a regime. This appears not to be in 
accord with the view that gauge interactions persist in extreme regimes to form part of a 
unified theory with gravity, such as in string theory. However, little is known about physics 



in such regimes, so this classical GR intuition might not hold. If string theory can tame 
such singularities, the circumstances under which gauge theory breaks down according to 
GR might not occur. However, it could even be that string theory breaks down in such 
a regime, since according to one interpretation, stringy matter could be a phase of some 
larger theory which breaks down in a high-energy phase transition j2U8j . Also, Carrollian 
regimes might arise in string theory under other circumstances, and exhibit different be- 
haviour from the strong-coupled limit of GR coupled to gauge theory, as suggested by the 
recent Born-Infeld study jl54j . 

2 Coupling of matter to conformal gravity 

It is an important test of the theoretical framework of conformal gravity to see whether 
it is capable of accommodating enough classical field theories to be a viable description of 
nature. Below we begin to show this is the case, in parallel with the previous section. The 
universal null cone result is also obtained. 



2.1 General theorems 

We require a modified version of theorem 1 for use in conformal gravity, and also a new 
theorem relating homogeneity in (p to the propagation of the new constraint of conformal 
gravity, p = Q. The forms of these below suffice for the construction of a useful range of 
classical field theories coupled to conformal gravity: we will demonstrate that the range 
of theories covered by these theorems includes much of known classical bosonic physics 
coupled to conformal gravity. Furthermore, these theories are picked out from more general 
possibilities by exhaustive implementation of Dirac's demand for dynamical consistency. 

Let ^'a be a set of matter fields that we wish to couple to conformal gravity, with 
potential term U* and kinetic term T*. We first decompose these as polynomials in the 
inverse metric. This is because it is the power of the metric that determines the powers of 
V that must be used to achieve the necessary homogeneity. Let ^^"^ be the set of fields 
such that these polynomials are of no higher degree than n. Thus 

(n) (n) 

T*'"'= E TSa.,....u.^^"^-^""= E (536) 

(fc)=(0) (fe)=(0) 
(n) (n) 

U*'"' = E ^..n...^.r''-^''' = E U(.) • (537) 

(fc)=(0) (fc)=(0) 

Then the following theorem guarantees that p = is preserved by the dynamical evolution. 
Theorem CI. For matter fields \I'("), the conformal gravity plus matter action of the form 



/ 2k / -p 2k 



J yl 

(538) 

varied with free end points is guaranteed to have p = V n G A/q. 

Note how the powers of V match the powers of the inverse metric that are needed to 
make 3-diffeomorphism scalars from the matter fields of different possible ranks. 



Proof Vacuum conformal gravity works, hence the theorem is true for n = 0. 
Induction hypothesis: suppose the theorem is true for some n = q. 



Then, for n = q + I, <f) variation gives 



(55^'?+^) (5S(«) 2(5-1) + 1 2 

w ^ ^ + - ^^^^^ '^-^^ - ^^(-^)^" 



(539) 



Now, from p^^+i) = p^i+^'^'^hij + p^^+^^'^ /iij and the metric ELE for p(i+^)'i , 



4iV 6hij 5hij 



Hence, by jSMl) P^"^'^ = |^ (^tI"^'' + ^'^ + 

by the induction hypothesis and using that U(g+i), T(q_|_i) are homogeneous of degree q + l 
in /i*-'. Hence, if the theorem is true for n = g, it is also true for n = q + \. But it is true 
for n = 0, so it is true by induction V n G A/q. □ 

As in IV. 1.2, we will now consider T* and as being made up of contributions from 
each of the fields present. We will label these fields, and the indices they carry, by capital 
Greek indexing sets. We then obtain the following formulae for the propagation of the 
conformal gravity Hamiltonian constraint. 

Theorem C2 

i) For nonderivative coupled matter fields ^'a with T* homogeneously quadratic in and 
U* containing at most first-order derivatives. 



iV '1 V 9(^6*b) 



(541) 



ii) If, additionally, the potential contains covariant derivatives, then there is an extra con- 
tribution to i): 

The proof offered here includes both conformal gravity (cr = 1) and strong conformal 
gravity (a = 0, A 7^ 0). Again, use of formulae i), ii) permits the "^Ti.^ calculations to be done 
without explicitly computing each case's ELE's. 

Proof i) For a homogeneous quadratic kinetic term T* = j^Q^^i! bG^^ > 

(543) 

the conjugate momenta are H''^ = — ■. — = j-G^^Bc^'b ■ 

d^A 2NV3 

(544) 



The ^*-variation gives the momentum constraint 
and the local square root gives a primary Hamiltonian-type constraint, 



(545) 
(546) 



in the distinguished representation. Then 



2 

1/3 



^ o p + WVi) - -^(2ri^GABn^ + GAen^n^), (547) 



using the chain-rule on 1)5461) and using h = V = {). Now use the chain-rule on U , the 
ELE's p^^ = and 11^ = and p = to obtain the first step below: 



v'i yl l^^A 5(a^'a) 



5R 



6h. 



-N 



2p 



-iV 



-2GABn^ 

Vh(7 



N- 



2 



-R - 2ap'- 



6l/jA 

5R 

6hii 



1 .idJ"^ ANVn 



2N^ dhij 



-J^ PikP 'P j 



N 



2NvIb 



Pikp'h^j 



-2p 



dhi 



1 



N - —p''^^ — ^'a* 



dhab y \fh 



J* 



(9U 

-2GABn^iv-— + 2GABn^i?i ( N 



2N' dhab 



^3 dGjKQ 



Pah 



N 



n 



A^(°^g^A) 



+ f ^n^GAB^ +2GABn^z?, ( N, 



5(A^a) VV^ 



d{Db^A), 
(548) 

In the second step above, we regroup the terms into pure gravity terms and matter terms, 
expand the matter variational derivatives and use the definitions of the momenta to elimi- 
nate the velocities in the first three matter terms. We now observe that the first and sixth 
matter terms cancel, as do the third and fourth. In the third step we use the pure gravity 
working and the momentum constraint ()545|) , and the definitions of the momenta to cancel 
the fifth and eight terms of step 2. Factorization of step 3 gives the result. 

ii) Now — *7^*-' has 2 additional contributions in step 2 due to the presence of the 
connections: 



be 



t\c - 2p'- 



be 



dV^hr 6h. 



N 



be '''ij 



(549) 



which, using ()168() and conformal gravity's slight modification of ()346p . 



be 



2\/h 



{A(iVpc") + D.iNpb'') - DyNpbc)} 



(550) 



integration by parts on the second term of (|549|1 and factorization yields ii). □ 

Although Theorem CI does not consider potentials containing Christoffel symbols, in all 
the cases that we consider below (which suffice for the investigation of the classical bosonic 
theories of nature) the propagation of "^"H^ rules out all theories with such potentials. Thus 
it is not an issue whether such theories permit p = to be propagated. 



2.2 Examples 

In this section we take a = 1 for Lorentzian (as opposed to Euclidean or strong) conformal 
gravity. We will also use W = from the outset, and A = 0, so that we are investigating 
whether our theory of pure conformal gravity is capable of accommodating conventional 
classical matter theories and establishing the physical consequences. We find that it does, 
and that the known classical bosonic theories are picked out. 



2.2.1 Scalar fields 



The natural action to consider according to our prescription for including a scalar field is 



dA / d-'x 



V 



vi 



/ 



8D'^(I) 



dA ^ , = / dA^ , (551) 

where, as in III. 2. 5, we give two different expressions to exhibit the homogeneity and to use 
in calculations. U^^^ is an arbitrary function of ? alone whilst U^-j^-^ = — 

The conjugate momenta and tt^ are given by (|428j) and (|427|) but with 



2N 



R 



-3- 

y3 



(552) 



and additionally we have the momentum conjugate to <j, vr = T^i'^ ■ 

(553) 

As in the case of pure conformal gravity, we have the primary constraint H429|) . and the 
end-point part of the (/)- variation gives = 0, so that p = by the primary constraint. But 
by construction (theorem CI) this action has the correct form to propagate the constraint 
p = provided that the LFE 



2{NR - D^N) + + 2A^U^^. = — / d?xVhN\}' + - , (554) 



3A^U^nx I [■ ^ T 

holds (in the distinguished representation), but this is guaranteed from the rest of the 
0- variation. 

The ^*-variation gives the secondary momentum constraint (|473() whilst the local square 
root gives rise to a primary Hamiltonian-type constraint, which is 

^ + + -^{pop + 7T^) = (555) 




in the distinguished representation. 



Then, using formula i), the propagation of the Hamiltonian constraint is, weakly, 



^T^c ^ (^_i)^^(^rVa^) . (556) 

Now, if the cofactor of (C — 1) were zero, there would be a secondary constraint which would 
render the scalar field theory trivial by using up its d.o.f. Hence C = 1 is fixed, which is 
the universal light-cone condition applied to the scalar field. This means that the null cone 
of gravitation is enforced even though the gravitational theory in question is not generally 
covariant in the spacetime sense. 



2.2.2 1-Form fields 



According to our prescription, the natural action to include electromagnetism is 



dX / d'^x 



V 



\ 



\ 



A V{<p)^ 



= / dA ^- ^^-^ — = / dA— , (557) 

J y(0)3 J V3 

for U"^ and T^ as in IV. 1.2. We will first show that electromagnetism exists as a theory 
coupled to conformal gravity. We will then discuss how it is uniquely picked out (much 
as it is picked out in RWR [SH])) and how Yang-Mills theory is uniquely picked out upon 
consideration of K interacting 1-form fields (much as it is picked out in jlUj). 

Again, the conjugate momenta and p^^ are given by (|427|) and l|428j) but now with 



2A^ 



' C ^3 



8D2^\ , U^V(<l>)Ti 



(558) 



a R 



and additionally we have the momentum conjugate to Ai, (|479|) . By the same argument as 
in III. 2. 5, p = arises and is preserved by a lapse-fixing equation, which is now 



2(NR - D^-N) + (a^U* - ^) Vf + J, |d3.v/ft (iVU* + ^) = f 



(559) 



The ^*-variation gives the secondary momentum constraint 14811 whilst the local square root 
gives a primary Hamiltonian- type constraint, 



AnjC 



Vh 



Yl ( 



1 

P°P^ 2"'^i'^ 

1/3 







(560) 



in the distinguished representation. 

Then, using formula i), the propagation of the Hamiltonian constraint is, weakly, 



(561) 



Suppose the cofactor of 1 — AC^ is zero. Then we require D^i,A^ = 0. But this is three 
conditions on Ai, so the vector theory would be rendered trivial. Thus, exhaustively, the 
only way to obtain a consistent theory is to have the universal null cone condition = j 



and the new constraint 



g = DaTT" = , 



(562) 



which we identify as the electromagnetic Gauss constraint 1)621 12U5|1 . The propagation of Q 



is no further bother because the Ai ELE i3^7r^ = 2VhC^Db{D^A'-D'A^) 

(563) 

is free of V and hence identical to that in the RWR case. Since the RWR argument for the 
propagation of Q follows from H563|) , this guarantees that the result also holds in conformal 
gravity. 

Since the potential is U(l) symmetric, we can finally encode this new constraint by 
making use of U(1)-BM , modifying the bare kinetic term by introducing an auxiliary 

variable = (i„ - £^Aa - ^a^*) (i" - £5 A" - 9'^$) . 

(564) 

The following extensions of this working have been considered. 

1) Additionally, replacing by C^'^'^DhAaD^Ac in the action preserves the correct 
form to guarantee p = is maintained. We now have derivative coupling contributions also, 
so we need to make use of formula ii) of theorem 2 as well as formula i). Thus, weakly 

-^T^c ^ (^^2 |4^^ ^ l)TT''D''Aa + (4C2 - l)vr"D^A* + 4C3{N\^ DaA") 

2 

This has the same structure in Ai as for the GR case [the overall V~3 is unimportant, as 
is the replacement of the GR (pij — ^hij) factors by (pij) factors here], so an argument 
along the same lines as that used in RWR will hold, forcing the Gauss constraint and 
Ci = -C2 = -3, C3 = (Maxwell theory). 

2) The changes — > T^^ = h'^ {Aj-£^A}){Aji-£^Aji) , 

(566) 

UA _^ = OiKC'^'D.AlDdAf + B^jKC^'''''D,AiaAiAf + IjKLMC'''"'AiAfA^Af 

(567) 

(for a priori distinct supermetrics C, C, C) to the ansatz preserve the conformal proper- 
ties, hence guaranteeing that p = is maintained by the lapse- fixing equation obtained 
by applying (|566( I567|) to (|559|) . The new conjugate momenta are ()5()1|) . ^*-variation 
gives the secondary momentum constraint, ()5U3() and the local square root gives a primary 
Hamiltonian-type constraint, 

_Ai^C = ^ (ai? + U^iyi) -^iP°P+ -^^hl) = (568) 

in the distinguished representation. 

Using formulae i), ii) we read off that the propagation of the Hamiltonian constraint is, 
weakly, 

-AiT^c ^ f^^2 |(4C7^oiK ^ s^'^)n'lD'AKa + (4C20^^ - (5^^)7rfDa^K 

+AC30^'^4DaA'k - [DaTT'^A'^' - 2C"^^'^i?^K^Ia^f 

-20%,,Z)(rf|^K|o(2^iC^'''^ - AC'^'") - B^jKPijAfAf{2A\C''^''' - A>i&^''')]) . (569) 



In the same sense as for the single 1-form case above, (j569p has the same structure as for 
the GR case, so the argument used in 10 wih hold, forcing 



BiijK) = 0^Ci = -C2 = -^ (570) 



_g 

4 

(for some emergent coupling constant g) and leaving the new constraint 



Gj = DaTT^j - gBlJKVrJ^^" . (571) 

Again as for the single vector field case, the ttj ELE is unchanged from the GR case. The 
action of the dot on ^Ka gives no volume terms. Hence the working for the propagation of 
Qj is unchanged from that in ^JJI , which enforces 

I = = = 

-^JKLM = B JK^ILM , C2 = —Ci = — , C3 = , (572) 

-B^jK-BiLM + -B^JM^IKL + -B^JL-BiMK = (Jacobi identity) , (573) 

i?ljK = -B[ijK] (total antisymmetry) . (574) 

From (|570() and H573|) . it follows that the -Bijk are the structure constants of some Lie 
algebra, G. From ()574() and the Gell-Mann-Glashow theorem jl501l364j . G is the direct sum 
of compact simple and U(l) subalgebras, provided that the kinetic term is positive definite 
as assumed here. We can defend this assumption because we are working on a theory in 
which even the gravitational kinetic term is taken to be positive definite; positive-definite 
kinetic terms ease quantization. 

3) In the BFO formulation of the TSA, mass terms are banned by the propagation of 
the Gauss laws. Mass terms contain nontrivial powers of the volume; however the above 
arguments can easily be extended to accommodate them. In the many vector fields case, 
the effect of a mass term is to give rise to a new term ^^-M'^^Al^- in the ELE's, which 

contributes a term 2M'^^Di ^-^^k^ to the propagation of Gj. For this to vanish, either 
= which renders the vector theory trivial, or M"^^ = 0. 

App IV. A: Naive renormalizability 

I use the below as part of the justification of the scope of the matter terms considered in IV 
and VI. Feynman diagrams are schematic computational rules for each possible contribution 
to particle processes; path integral approaches are descended from such concepts. Fig 11 
has my propagator convention and depicts the terms used below. 

Now, for a boson-fermion theory with (/ + 6)-point interactions, the general Feynman 
diagram with n(/, h) (/ + 6)-point vertices, Fi internal fermion legs, F^ external fermion 
legs, Bi internal boson legs, -Be external boson legs, and L loops has the momentum space 
form 

a^y nE/.,n(/,fe) ^ .^.B, ^ ,^.F, ^ . 5 .functions due to 

{2tt)^ ) \p J V mass conservation 

This has a superficial degree of divergence = 4L — 2Bi — Fi. But by L = Bi + Fi, 
E/ Mf, b) = 2Fi + Fe and Zb bn{f, b) = 2Bi + Be, + b < 4 for any vertex type n{b, /), 
lest there be divergence. Also, observed lepton number conservation requires fermion legs 
to occur in pairs. 




Fi gure 111 The external legs represent the input and output of the particle physics process (scattering), which 
although practical, has Minkowsltian (open-universe) connotations if extrapolated. Internal legs and loops are also 
pictured. 

Thus the only possible type of interaction involving fermions is n(2, 1): the 
~ 99^ of Yukawa theory , and 

(p(pAof QED, QCD and Weinberg-Salam theory. 

For bosons alone, the permissible interactions are 

^^^^^,s'*of interacting scalar field theory, 





AdA, A of Yang-Mills theory. 




J 

sysA^ s*sA of U(l)- or Yang-Mills-scalar gauge theories. 



These are the set of theories of conventional particle physics (1.1.7), and are the 
ones I aim to couple to GR in the TSA in VI. 



App IV.B: Gell-Mann— Glashow theorem 

An algebra A is a vector space V equipped with a product □ : A x A — > A. A subalgebra B 
of A is a vector subspace Wof V which is also an algebra with product □. It is invariant if 
b n a & B y b £ B,\/ a £ A. A real representation F of A is a map F : A — > GL{n, SR). It is 
reducible if 3 some subspace [7 in V that is left invariant by F. If it is not reducible, it is 
irreducible. It is totally reducible if it can be written as a direct sum of reducibles. 

A Lie algebra G is Abelian \i \ [gi,g2\\ = V gi, 52 £ G. An Abelian algebra with 1 
generator is a U(l) algebra. G is simple if it contains no proper invariant subalgebras and 
it is not Abelian. G is semi-simple if it is the direct sum of simple Lie algebras. A simple 
or semisimple Lie algebra is compact if tr{TxT^) = /*^ad/^bc is positive-definite. 

Now for the Proof of the Gell-Mann-Glashow theorem 115(1] : 

H519|) =^ (|520|) trivially by basis change, provided that Qab is positive definite, in which 
case we can define {Q~'^)ab- 



In the basis assumed in (|52U|) are matrices of the form ('f^)BC = —if'BCA-'^'^ Because 
these are imaginary and antisymmetric, they are by definition Hermitian. Thus we can use 
the simple fact that Hermitian matrices are either irreducible or totally reducible. . In the 
irreducible case, by definition there is no proper W C V that is left invariant by the f^, 
ie there is no set of less than K linearly-independent vectors (t'/j)B such that T^(f_R)c is 
a linear combination of the (ijeJc- As (f^)BC oc /abc, this means that there is no set of 
linear combinations T/j = (fR)c^c ^^^^ closed under |[ , ]| with all the (t^)- such a 
set would provide the generators of an invariant subalgebra. Hence the absence of such a 
set means by definition that the Lie Algebra is simple. 

In the totally reducible case, then there must be a suitable choice of the (i^) that are 

block-diagonal supermatrices (tA)ME,NF = (i^^^^)EF<^MNi where the constituent r^*-^^ 
submatrices are either irreducible or vanish. Adopting this basis also for the Lie algebra 
gives structure constants of the form 

/lC,MA,NB = «('?Lc)mA,NB = i(TLc^'')AB<JMN (575) 

But, since / is totally antisymmetric, this is proportional also to (5ln and 5lm- So for any 
representation = tma of the Lie algebra in this basis, 

i[rr,rr]i=.<^MN/^;ikrr (576) 

for /q^b real and totally antisymmetric in CAB. This is what we mean by an algebra 

being a direct sum of subalgebras, and furthermore, for each M, is either irreducible 
(corresponding to a simple subalgebra), or zero (corresponding to a U(l) subalgebra). 

Finally, V ^;a G 3«^(M), Q^^ = -f^^USiW^B > 

(577) 

since by antisymmetry it is a sum of positive quantities. Furthermore, strict equality occurs 
only if t^A = 0. For suppose not. Then ua^^^ is an invariant Abelian subalgebra, which 
is a contradiction, since the form a simple Lie algebra. 

(|521j) means that in some basis = Cma,A''"a- To construct Qab, take Qma,nb = 

Q^^Smn, which is an arbitrary real symmetric positive-definite matrix since each 
is. Then (|519j) follows for each simple subalgebra, using first the Jacobi identity and then 
antisymmetry: 

QSJV(^)°AB=/^^^^AD/^^)°BE/^^^^CF-/^^)^D/(^)°BE/(^)^CA=-Qi3lV(^)°^ 

(578) 

This is also trivially true for the U(l) subalgebras since their structure constants vanish. 



Corollary: One can always define the scale of the gauge fields such that Qab = ^ab- 
proof : I [Qab ) ''"a] I = ^'^^ totally-antisymmetric structure constants. All the can 
be put into block diagonal form, with irreducible or zero submatrices along the diagonal. 
Then by Schur's Lemma, Qab must also be block-diagonal, with blocks of the same size 
and position as in the t^, and with the submatrix in each block being proportional to the 
unit matrix. So Qab has the form Qma,nb = gM<^MN^AB where the are arbitrary real 
positive numbers. Then we can always rescale the gauge fields so that these are all equal 
to 1. □ 

App IV. C: Teitelboim's inclusion of matter into the HKT 
route 

'^^A is for adjoint and R is for reducible. 



One wishes to include matter fields ^ in such a way that the resulting 7i and Tii close 
as the Dirac Algebra (|287|) . Teitelboim notes and uses that minimally-coupled scalars, 
electromagnetism and Yang-Mills theory have '^Ti ultralocal in hat- It so happens for all 
these examples that 

'^n = n + ^n and ^ni = ni + ^n^ (579) 

where ^"H and \Sj7ii are the matter contributions, and that ijjTC and ^iTCi separately obey 
the Dirac Algebra [3481 1351] . This is the result mentioned in IV. 1.2. 

The minimally-coupled scalar is trivially ultralocal. For a single 1-form to work in this 
way, it is enforced that -D^TTm is physically irrelevant, and thus one quickly arrives at 
electromagnetism. For many 1-forms, ultralocality and the inclusion of the single 1-form 
case enforces Qi = DaT^j + g^C^ijAfir^ to be physically irrelevant. Now, by Teitelboim's 
extra assumption that the Qi generate an internal symmetry, the Poisson bracket of two 
^I's must be a combination of these ^I's: {^(A^), ^(A^)} = Q{Q). This enforces both 

C^AB = -C^BA (580) 



and, via = C^^abA^ Af , 

(581) 

the Jacobi identity C^jkCilm+C^jmCikl+C^jlCimk = . 

(582) 

From the appearance of these, Teitelboim deduces that the theory has gauge symmetry. 
He argues this to be a consequence of embeddability. As for the TSA, I observe that the 
resulting Lagrangian contains a positive-definite combination of internal indices of Ff^ from 
which the GMG theorem tells me that the corresponding Lie algebra is furthermore a direct 
sum of U(l) and compact simple Lie subalgebras. 



V TSA: Discussion and interpreta- 
tion 

1 TSA versus the principles of relativity 

I take Wheeler's question about the form of the Hamiltonian constraint seriously. Starting 
from the relational 3-space ontology, the TSA gives Hamiltonian-type constraints 

'T^trial ^ ^(^^ + A + U„) - |y (^p o p - ^p2^ + GABn^n^l = (583) 

as identities from RI. Consistency alone then dictates what options are available for T^tfis-' - 
the Dirac approach. I have included matter fields ^'a since I have found that conclusions are 



best made only once this is done, ^'a is s.t T* is homogeneous quadratic in its velocities 
and U* at worst depends on connections (rather than their derivatives). I then get the 
following master equation for the propagation of the Hamiltonian-type constraint: 



H 



trial 



N 



i^a(D^Pai, + {X-\}Dap) 



+ 



au* 1 



2 dT-,j 



+GABn^ 



(584) 



The strategy tied to the Galilean RP2 for this is to declare that y = 0. This kills all 
but the last factor. It would then seem natural to take li^ = 0, whereupon the fields are not 
dynamical. They are however not trivial: they include fields obeying analogues of Poisson's 
law, or Ampere's, which are capable of governing a wide variety of complicated patterns. 
One would then have an entirely nondynamical 'Galilean' world. Although this possibility 
cannot be obtained from a BSW-type Lagrangian (the T factor is badly behaved), this 
limit is unproblematic in the Hamiltonian description. Of course, the Hamiltonian-type 
constraint ceases to be quadratic: 



n 



(Y = 0) 



o-i? + A + Uvi/ = . 



(585) 



Now one might still vary with respect to the metric, obtaining a multiplier equation in place 
of the ADM (or BSW) evolution equation, N{WR - K'^) = h'W^N - D'D^N. In vacuum 
the trace of this and H = R = leads to D^N = which in the absence of privileged vectors 
implies that N is independent of position so that clocks everywhere march in step. Then 
also Rij = 0. The cosmological constant alone cannot exist in an unfrozen CWB world. 
But the inclusion of matter generally breaks these results. One might well however not vary 
with respect to the metric and consider the worlds with a fixed spatial background metric. 
This includes as a particular case the Hamiltonian study of the flat spatial background 
world in the local square root version of App II. B, but permits generalization to curved 
backgrounds. 

The strategy tied to the CarroUian RP2 is to declare that a = 0. One still has the 

penultimate term so presumably one further declares that U'^ contains no connections (the 
possibility of connections is studied more satisfactorily in VI). It is 'natural' then to take 
the second factor of the last term to be thus obtaining a world governed by Carrollian 
relativity. 

The strategy tied to the Lorentzian RP2L is somewhat more colourful. Use = — 1+1, 
reorder and invent a momentum constraint: 



^trial^ 



2D° 



1 



H 



+- 



1 



H' 



~5f 



-GabH 



+YaiX - l)D.p + Y - f (|L,e. _ 1 I) . (536) 

Now go for the orthodox general covariance option: that the third and fourth terms cancel, 
enforcing the null cone. This needs to be accompanied by doing something about the fifth 
term. One can furthermore opt for the orthodox X = 1: the recovery of embeddability into 
spacetime corresponding to GR (RWR result) , or for the preferred-slicing but York-GR-like 
worlds of DaP = 0. Either will do: the recovery of locally-Lorentzian physics does not 
happen for generally-covariant theories alone! One requires also to get rid of the connection 
terms but the Dirac procedure happens to do this automatically for our big ansatze. Thus 
GR spacetime arises alongside preferred slicing, Carrollian and Galilean worlds, in which 
aspects of GR-like spacetime structure are not recovered. 



With the above in mind, a clarification is required as regards the previous use of ex- 
haustive proofs. The ultralocal and nondynamical strategies for deahng with the last term 
of ()584() are available in all the above options. It may not shock the reader that degener- 
ate and dual-degenerate possibilities might coexist. Indeed Carroll matter in the Galilean 
option permits a BSW Lagrangian... But in the Lorentzian case this means RPl is not 
fully replaced! At the moment, we do derive that gravitation enforces a unique finite 
propagation speed, but the possibility of fields with infinite and zero propagation speeds 
is not precluded.''^ Thus the objection that Newtonian mechanics and electromagnetism 
have different relativities is precisely not being countered! So in this approach, if one were 
to observe an analogue of electrostatics (a Poisson law), or of magnetostatics, one could 
not infer that there is a missing displacement current (or any other appropriate individual 
'Lorentzifications' of electrostatics and magnetostatics in the absence of a good reason such 
as Faraday's Law to believe in unifying these two analogue theories). One would suspect 
that formulating physics in this way would open the door to analogue Aethers coexisting 
in a universe with Einstein's equations. 

In more detail, BFO dismissed this possibility as trivial from counting arguments. But 
these are generally misleading, since they do not take into account the geometry of the 
restrictions on the solution space. It is true that if there are more conditions than degrees 
of freedom then there is typically no solution, but some such systems will nevertheless have 
undersized and not empty solution spaces. 

As a first example of this, consider the flat spacetime single 1-form case of App II. B. 
The crucial term is then (1 — C)'KiF^K The C = 1 option gives the universal light-cone, 
but the other factors could be zero in a variety of situations: they mean a vanishing Poynt- 
ing vector: E x B = 0. This includes E = (a fragment of magnetostatics), B = (a 
fragment of electrostatics) and E || B- Each of these cases admits a number of solutions. 
These include complicated patterns analogous to those which can occur in electrostatics 
and magnetostatics, which could not be described as trivial. 

As a second example, consider the single 1-form in homogeneous curved spacetimes. 
TT = E = imposes a severe but not total restriction on Minisuperspace. The Bianchi 
types IV, y, VI (h / —1), VII (h = 0), VIII, IX are banned outright, whereas the flelds 
in Bianchi types II, VI (h = —1), VII (h ^ 0) have less degrees of freedom than expected 
pointwise in Einstein-Maxwell theory.''^ Nevertheless, solutions exist (see p 202 of |223j ). 
The treatment of B = is identical to that of E = by dual rotation. E || B also 
admits nontrivial solutions such as the charged Taub metric, or its generalization on p 
195 of j223j . These are not trivial models. Thus one has indeed an undersized but still 
interesting solution space. Now, these solutions could all be interpreted as belonging not 
just to Einstein-Maxwell theory, but also to a theory T with Einstein cones and distinct 
(even degenerate) cones belonging to a strange 1-form theory. 

However, despite these examples illustrating non-triviality, RPl is safe. For, the theory 
T permits no macroscopic 1-form propagations, since Ei = means no momentum, Bi = 
means the theory is ultralocal so 1-form information does not propagate away from any 
point, and Ei \\ Bi means that there is none of the mutual orthogonality that ensures the 
continued propagation of light in electromagnetism. In the absence of such propagation, 
the concept of a 1-form particle moving in a background solution of theory T makes no 
sense (since this is but an approximation to the field equations of theory T, which permit 
no 1-form propagation). Thus such a 1-form is causally irrelevant, so the recovery of RPl 
from the TSA is not affected. 

Moreover, one does have a source of potentially nontrivial scenarios from this insight: 

^^The latter is the analogue of the 'each particle moving in its own potential' bad-ordering mechanics 
example in II. 2. 2.1. 

^^h is a further invariant for types VI and VII given by (1 — h)i^BAi'^CD = — 2hL'^DBi'^Ac for I/^bc 
the structure constants of each Bianchi model's associated Lie Algebra. 



such nonpropagating Carroll or non-(c = 1) Lorentz or Galileo fields could nevertheless 
be coupled via potential terms to propagating fields, leading to scattering of the propa- 
gating fields. Whether such unusual fields are capable of producing interesting theoretical 
cosmology results may deserve further investigation. 

2 Discussion of conformal theories 
2.1 Discussion from ABFO paper 

The conformal branch of the TSA provides examples of theories of evolving 3-geometries 
that do not fit together to form spacetime. Nevertheless, the classical bosonic physics 
corresponding to these theories has the standard locally Lorcntzian form. Thus the universal 
null cone can appear alongside preferred slicing rather than cmbcddability. 

The differences between the conformal-gravity-matter and GR-matter metric ELE's are 
small: the absence of a term containing the 'expansion of the universe' p and the presence 
of a global term such as 



where scalar matter has been included. Such a global term mimics the effect of a small 
epoch-dependent cosmological constant. This global term is a 'cosmological force' because 
it occurs in the ELE's with proportionality to /i"^, just like the cosmological constant 
contribution docs in GR. We expect it to be epoch-dependent because it contains matter field 
contributions, which will change as the universe evolves. The occurrence of this global term 
should be compared with the scale-invariant particle model, in which there is a universal 
cosmological force induced by all the familiar forces of nature such as Newtonian gravity 
and electrostatics. There, this cosmological force is extremely weak over solar system scales 
but has a decisive effect on cosmological scales, ensuring the conservation of the moment of 
inertia. Our global term is an action-at-a-distance term which is the price to pay for our 
particular implementation of scale invariance. Some of our other conformal theories do not 
have the global features; correspondingly they allow for the volume of the universe to be a 
meaningful concept. Thus these other theories are closer to GR than conformal gravity in 
this respect. 

Conformal gravity represents a new approach to scale invariance. Its construction shows 

how BM and constraint propagation are powerful tools for a different way of constructing 
theories. Conformal gravity also highlights the thought-provoking manner in which GR 
only just fails to be fully scale invariant. 

2.1.1 On the weak field limit 

We expect that conformal gravity will pass the the solar-system tests just as GR does. This 
is because, first, the expansion of the universe does not play a role on such small scales in 
GR so its absence will not affect the results. Second, at maximal expansion, a data set may 
be evolved by both the GR and conformal gravity equations. The difference between these 
two evolutions is well defined in Riem. Since the first derivatives match up at maximal 
expansion, the difference between the evolutions is small. For sure, the size of the difference 
will depend on the global terms. But these can be made small by a well-known construction 
as far as the finite-time evolution for a patch of initial data that is substantially smaller 
than the radius of the universe is concerned. Such patches can be constructed to contain 
a simple model solar system: a patch of spherically-symmetric weak field regime on which 
test particles and test light-rays move. I have found nothing to date to contradict this 
assertion, briefly noting that the conformal gravity system is the same as the ADM one to 
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first order. I have not pursued this very far because the currently unresolved cosmological 
difficulties below stand out as stronger grounds on which to question conformal gravity. 
Binary pulsar calculations, which in GR involve the second-order Ricci tensor [6<5'6\ . may be 
harder to check than solar system tests. Additionally, in GR these are based on a tensors 
on Minkowski spacetime approximation, which may not be applicable to conformal gravity. 
Finally, conformal gravity is the extreme in difference from GR of the conformal theories. 
The closer to GR, the less doubt there is about the reproduction of GR results. 

2.1.2 Cosmology 

On account of the strong evidence from the Hubble redshift, nucleosynthesis and the mi- 
crowave background it does seem unlikely that conformal gravity will be able to supplant 
the Big Bang cosmology. Crudely, as conformal gravity currently stands there is none of the 
first and the grounds on which a standard 'hot' explanation of the other two would be based 
are questionable (although Barbour illustrates some simple points in So when writing 

ABFO, we considered rather the potential theoretical value of conformal gravity as a foil 
to the Big Bang. Theorists concerned with achieving the deepest possible understanding of 
cosmology and the foundations of physics value alternative models,'''^ even if they explain 
or mimic only part of the whole picture. This is akin to the role of Brans-Dicke theory in 
solar-system tests of GR. 

Conformal gravity may require a more sophisticated approach to cosmology. Consider 
the Friedmann-Lemaitre-Robertson- Walker (FLRW) homogeneous and isotropic cosmolo- 
gies, which are the backbone of standard cosmology. As solutions in which nothing changes 
except size and homogeneous intensity, they are suspiciously trivial from a dynamical view- 
point. In a scale-invariant theory, the FLRW-type solutions are merely static points in 
the configuration space. There has long been concern j224j about the accuracy with which 
FLRW cosmologies approximate more physically realistic inhomogeneous solutions of GR 
under the assumption that it is the correct physical theory. Conformal gravity raises a 
more serious doubt: it gives the possibility of physics and small-scale gravity that is in 
close agreement with GR but which differs greatly from GR cosmologically (and quantum- 
mechanically) . In comparison, the Brans-Dicke/dilatonic modifications of GR 367,. 68] 
have not been found to significantly affect the key physical basis of Big Bang cosmology. 

Since conformal gravity has no dynamics analogous to the FLRW universes of GR, the 
only possible direct progress in its cosmology would be through the study of anisotropic 
and especially inhomogeneous solutions. This is the opposite emphasis to the norm in 
classical and quantum cosmology. In ABFO this was said to "have some chance to throw 
up a radical new explanation of the redshift" This was based on it being known that 
in GR, in addition to Hubble redshift, a change in dumpiness (shape) of the universe can 
cause redshift. The solar photons that reach us are redshifted by having to climb out 
of the solar gravitational potential well (gravitational redshift), and inhomogeneities cause 
similar effects in cosmology (the integrated Sachs-Wolfe and Rees-Sciama effects |2851l3U8j ). 
The scale-invariant particle model is suggestive in this respect. We speculated that the 
rearrangement of geometry and matter of an evolving universe can cause a similar redshift 
in conformal gravity. In such a case, it will not be due to differences in the gravitational 
potential between different points of space but between different epochs. Now, the potential 
can be changed either by a change of scale or by a change of shape. In conformal gravity, 
the former is not available and so the latter would have to be the origin of the observed 
cosmological redshift. Since the change of shape of the universe can be observed, this should 

^^For example, in [l^Mj an eternal singularity is presented as an alternative explanation for the isotropy 
of the microwave background; in |268| it is shown that an anisotropic, Bianchi V universe can account for 
the correct light element abundances. 

^^This is not the only way out of difficulty though. I explain in the next subsection why I currently rather 
favour more conventional explanations. 



lead to testable predictions. 

Conformal gravity may also offer a different perspective on singularities. The Big Bang 
itself is an initial singularity where the known laws of physics break down. It is inevitable 
in GR by theorems of Hawking |175j . These require the expansion of past-directed normal 
timelike geodesic congruences to be positive everywhere on a given spatial hypersurface. 
The GR form of these theorems will not hold in conformal gravity since such a notion of 
expansion is no longer meaningful.''^ In GR, the Hubble redshift interpretation forces one 
to admit the breakdown of known physics in our finite past, whilst in conformal gravity, 
the denial of such a breakdown requires a new interpretation of the Hubble redshift. 

Whereas our greatest interest is in whether conformal gravity can give us an alternative 
cosmology, our CS+V theory has a notion of universal expansion, so it will be much closer to 
GR both in agreeing with the standard cosmology and in not offering these new perspectives 
on nonsingularity and global cosmological forces. I provide more cosmological ideas in 
V.2.3.2. 

2.1.3 Brief quantum outline 

We finish with a simple discussion toward quantization of conformal gravity |1H [2] . We 
wonder about what role TCc now plays. This and the fundamental LFE H432|) are nonstan- 
dard objects from the quantization perspective. The new global terms may also play a role. 
Whereas in GR the DeWitt supermetric gives an indefinite i.p as a consequence of the sign 
of the expansion contribution to the kinetic energy, in conformal gravity the new W = 
supermetric gives a positive-definite i.p, altering the status of the i.p Problem. Our quan- 
tum program is attractive in that conformal gravity has a marginally smaller configuration 
space than GR (rather than some choice of additional structures). Note that these features 
are widespread throughout the various conformal theories. 

The study of some of the novel features of quantum conformal gravity can be isolated 
by the sequential study of W = strong gravity, strong conformal gravity and conformal 
gravity. The effect of using a positive-definite W = supermetric can be tried out in 
W = strong gravity. Then the additional effect of introducing a volume and of the role 
of Ti'^ can be tried out in strong conformal gravity, while the additional conformal gravity 
complication of a nontrivial integro-differential LFE is absent here, from my observation 
that its counterpart here leads to being a spatial constant. 

We hope to use a 'top-down' approach. However, we start from space rather than 
spacetime for relational reasons [37 , 30 , 34, .38^ JS| and to illustrate that it is potentially 
misguiding to always presuppose and generalize spacetime structure. We hope to quantize 
in the timeless naive Schrodinger interpretation favoured by Barbour |3U1 1311 

The Problems of quantizing gravity are hopelessly interrelated, so that adding to a par- 
tial resolution to tackle further Problems can spoil that partial resolution |237j . III. 1.5 
contains a further example of this malady: it is not to be expected that Ashtekar variable 
techniques , with their simplification of operator ordering and their natural regulariza- 
tion, could be imported into conformal gravity. Thus, quantization of conformal gravity 
will differ from, but not necessarily be easier than, quantization of GR. Should conformal 
gravity adequately describe the classical universe, its quantization program will become 
more important. Even if this were not the case, we expect to further the understanding of 
quantization and of quantum GR by the study of such theories as toy models. 



We do not know if other forms of singularity theorem hold. We cannot so easily dismiss results involving 
null and/or local expansion. Another source of trouble in adapting GR proofs for conformal gravity will be 
the lack of a 4-d generally covariant equivalent to the EFE's. If local singularities form, they will contribute 
to the global terms in conformal gravity everywhere. This could be fatal in the particle model, but in 
conformal gravity there may be two ways out. First, singularities may only contribute a finite amount once 
integrated. Second, there may be a tendency to preclude singularities in our conformal theories by the 
'collapse of the lapse' becoming more than a gauge effect (see V.2.3.3). 



2.2 Further conformal alternatives 



2.2.1 First-principles formulation of CS+V theory 



Here are some well-founded candidates for CS+V theory. These particular candidates use a 
'Laplacian implementation' of volume-preserving conformal transformations (VPConf) like 
in that in III. 2. 6, but are built adhering to various more clear-cut first principles. Note 
that we are also in the process of considering alternative implementations |12j . The finite 
Laplacian implementation of the VPConf transformations in III. 2. 6, 

hab hab = (1 + D^C)hab , (588) 

is unsatisfactory, for 6 Murchadha and I independently found that these do not close to 
form a group. Rather, 

hab ^hab^hab=\\^ D\i + ^ ' t hab 589 

in terms of the Laplacian corresponding to the original metric, which is not of the form 
(1 + D'^'E)3hab for some H(^i,^2)- This is related to not being a conformally-covariant 
object. Moreover, there are no simple conformally-covariant differential operators acting on 
scalars that are plausible here. 

But (as presented in II), I realized that the form of BM is dictated by the infinitesimal 
transformations associated with the generators. Now, the 

~ 2 

clearly VP infinitesimal version of ()588() hab — hab = {^+-^D'^Ohab 

(590) 



do close to form a group: hab ^ Kb Kb ^ (^^ + 3^^*^^^ ^^''^^j ' 

(591) 

Thus one should rather use this infinitesimal Laplacian implementation, which in fact reverts 
to earlier drafts of the paper jllj . 

I want an action such that part of the variation with respect to auxiliaries gives the 
CMC condition = C(A) and another part gives the correct LFE to propagate this. I 
offer two approaches to this. Both follow from my arbitrary frame principle. The difference 
between them arises from there being two different kinds of covariance. 

The 'York style' is to write the action in the arbitrary conformal frame out of good con- 
formally covariant objects, bearing in mind that the theory only has a temporary technically- 
convenient conformal gauge symmetry (with respect to the bona fide metric scale-factor ijj), 
since the Lichnerowicz equation then gauge-fixes ip by specifically mapping to a particular 
point on the Conf orbit. ^'^ Consequently there is no Conf BM in this approach: conformally- 
bare velocities are to be regarded as conformally covariant. This conformal mathematics 
underlies both York's work and our attempt to write down a CS+V theory in III. 2. 6, and 
explains the underlying technical similarities. In this approach, what might have been re- 
garded as the trace and tracefree parts of a single tensor are rather regarded as distinct 
objects which are alloted distinct conformal rank as befits the formation of conformally- 
covariant derivatives. In particular, a relative scaling of the u"^ o u"^ and u"^ of arises. 

^''Thus this use of gauge theory is different from the U(l), Yang-MiUs and Diff uses of gauge theory. 
There, the choice of gauge is unphysical, whereas here it is physical because it is prescribed by an additional 
condition required for passage to the physical 3-metric. 



This corrects the naive mismatch in amount of Np^ between the auxihary variation LFE 
and the LFE required to propagate the CMC condition. In this approach, there are 
two distinct auxiharies, and the one encoding the CMC condition {rj) should probably be 
regarded as a multiplier, not a best-matching. The action is 



\ = i d\i SxVh { 1 + ^) J (r - ^\ Ju^ ou^-^il- 2AC) {n + Arjf 



This is (the infinitesimal version of) what was considered in III. 2. 6. 
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The 'Barbour style' is to consider an action with a true conformal symmetry, with scale 
factor uj. Then (j) is neither the symmetry scale factor nor a bona fide metric scale-factor: it 
is an extraneous auxiliary whose role is to make the conformal symmetry true via its banal 
transformation property. Then VPConf-BM is indeed required^^ to make hij into a good 
VPConf object. The natural action is then 



dA j d^x^/h^^J^'"^ (^R 



with 



1 + infinitesimal, so that 



40 
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dA I d^xVh(^i + '^^ Mr 



4A^C 



'u^ ou^ [ u + 2 



. g(AC) 
dX 



(594) 



The momenta are 



(595) 



for p^^ the GR expression for momentum, and 

Then free-endpoint variation and the trace of ()595() give 



P 
Vh 



C{X) , p = p(l + AC) i.e. p 



(596) 



(597) 



so that the naive p is replaced by the scaled- up p. 

Note that (|594|) does not have a relative scaling of (j)^"^ between o and u^. Before, 
the exclusion of this led to a mismatch in the Np'^ terms between the (^-variation and 

= C(A) propagation LFE's, so one might expect (|594j) to be inconsistent. However, it 
turns out that (^-variation of (|594j) has a new source of Np'^ terms. 
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by the W = Y = 1 cases of (|344|) and (|346|) . This term occurs with cofactor numerically 
proportional to and as this is constant, it can be taken inside A, and happens to 
contribute just the right amount of Np'^ to the (^-variation LFE to obtain the CMC LFE. 
Note also that a '(/>^^' relative scaling emerges in the primary constraint arising from 



use u = tr{Q^hij) and 



^^It is in fact only a nontrivial correction to it since hj^ 



squaring the momenta, due to the nonminimal couphng: 

,2 J2 



G.^MP'^p'^ =p^op^-?-=p^op^-P-il + 2 AC) . (599) 

DO 



This is r{R- ^) = pTopT_ ^12P' 
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which may be identified with the Lichnerowicz-York equation. The above working could 
be seen as an alternative derivation of this equation. 

Next, note that we came across the above via setting the D^iNDap) factor in (1^811) to 
0, but they are also resliceable since = 1 i.e the constraint algebra works just as well 
if CMC slices are not chosen. Thus I identify the above as not being alternative theories 
but new formulation of CWB GR in the CMC gauge. However there is no longer any 
consistency reason for setting W = 1. So I can consider the arbitrary- versions of the 
two approaches above in order to obtain alternative theories of gravity which have genuine 
privileged slicings. These follow from replacing the — |'s in the actions by ^— f^- Thus, if 
one takes York's IVP mathematics more seriously than GR itself, then one is entitled to 
consider a range of privileged-slicing theories in addition to resliceable GR. 

2.2.2 Recovery of conformal gravity 

The single-auxiliary formulation of conformal gravity is clearly in the 'Barbour style'. The 
2-auxiliary formulation is not quite in the 'York style', but can be easily made to comply 
with it since the scaling of is rendered irrelevant for conformal gravity because p = 
arises. The action is then 



fd'xVhcpKU-^ (r - JuT ouT-{W-l){n + Of ^ 
d\ 1 — . (601) 

1/3 



Note also how W = \ (not W = now clearly gives the simplest presentation (but this is 
the intractable degenerate case). 

2.2.3 Further alternative conformal theories 

The survey of alternative theories in III.2.6 is not exhaustive. First, recall that the trick 
of dividing through by the volume was adopted in the maximal CWB case to allow the 
LFE to work. However, there is nothing stopping one also trying to divide through by the 
volume in the CMC CWB case, even though the LFE is already well-behaved without this 
correction. This gives Kelleher's theory |212l 121 J] . 

I = / dA ^- — . (602) 



This turns out to be restrictive: ry = fuiiy constant , 

(603) 

is enforced giving an interesting (almost certainly too) restrictive cosmological scheme (see 
V.2.3.2). 

Another alternative theory is my 



y dA y d=^xV)I</.Mi?- y^/To^T _ 2^-12 2)2^)2 (Q04) 



for cj) varied standardly rather than freely flapping. Then consistency requires the York time 
to have the freedoms of Newtonian absolute time: 



TY = C\ + D : C , D fully constant 



(605) 



from = F{X alone) and F{X) 



d_ f p \ 
dX \VhJ 



Q{x alone) 



where the last step is by the auxiliary variation. Hence F = Q, a total constant (and 
C = ^). Thus I name this theory Newton-York absolute time theory. It is intermediate in 
restrictiveness between GR and Kelleher's theory. I found that all the other combinations 
of types of auxiliary did not yield any further distinct consistent theories. 

Note that all bar CS+V and CG are 2-auxiliary 'York style' theories, with their (/>^^ 
relative scalings. I am not sure whether the choice of implementation (out of the above two, 
the Laplacian one, or the one in '12 ) affects precisely which 'CS+V configuration space is 
involved. We consider this well worth investigating, given the connection between CS+V 
and a representation of the CWB GR d.o.f 's. 

2.3 Discussion and interpretation 

2.3.1 Conformal gravity and CS+V theory as PDE systems 
2.3.1.1 Traditional thin sandwiches 



For conformal gravity, there is no Problem of zeros globally for all of its solutions in the 
classic thin sandwich approach since the potential is of fixed sign. The impasse may then 
be instead whether such fixed sign potentials are capable of describing moderately compli- 
cated astrophysics. For 'York-style' CS+V theories, differences in W should again alter the 
behaviour of the thin sandwich formulation. For both conformal gravity and the CS+V 
theories, the auxiliary and its A-derivative (or the second auxiliary) become involved so 
that one has not the usual sort of 3-equation thin sandwich system, but rather an extended 
5-equation system. 

Here is the conformal gravity vacuum traditional thin sandwich system for ^, (p and (p: 
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The Barbour-style CS+V traditional thin sandwich system for ^, C, and C, is messy once 
I substitute for A^(^, C, C) so I do not give it exphcitly. One feature is that the 3-vector 
equations are now fifth order whereas the LFE is sixth order, both due to the D'^C, in 
the potential. The W = 1 case is an as-yet untested scheme for a (piece of CWB) GR. 
Were it to have good p.d.e properties, it would be of considerable direct interest both in 
numerical relativity and in quantization. But it is plausible it will pick up difficulties from 
the traditional thin sandwich subsystem it includes. However, allowing for other values of 
W gives elliptic thin sandwich operators (see C.3). Thus it may be so it may be that some 
working (toy?) worlds emerge from such a study, in which some of Wheeler's old classical 
and quantum hopes could be tried out. 



2.3.1.2 Conformal thin sandwiches 



I next consider the 'York style' W = 1 CS+V conformal thin sandwich. This is messy 
if written using 1 + D^d^ for (p, so I keep this as a later admissibility restriction to be 
imposed on the (/> solutions. Collecting together the W = 1 CS+V LFE, Dip^^ = and the 
'VF = 1 CS+V Lichnerowicz-York' equation, the conventional GR conformal thin sandwich 
of 1.2.9.4.1 arises once the equations are re-expressed in terms of ay (s.t ay = (j)^ay rather 
than N = (p'^N), and the Lichnerowicz-York equation along with the product rule is used 
in the LFE to eliminate D'^cp. 

Next I consider the conformal gravity conformal thin sandwich formulation. Collect the 
conformal gravity LFE, Dip^^ = and the 'conformal gravity Lichnerowicz equation' (|434)1 . 
write in terms of ay, and use the product rule along with (|434l) to eliminate D'^cp in the 
LFE, thus obtaining: 

-cp^D'^ay - Ucp^da^d'^ay - 42da(pd''(pay 



+^K^ oK^ -^^"^ayR = V-3(j)^ [ Vhd^xayK^ oK^ , (608) 
40" 4 J 

a(^[(|W-AT^^]) = 0, (609) 
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This differs from GR's conformal thin sandwich and is thus a starting-point for confor- 
mal gravity predictions to compete with GR's as regards colliding compact objects. Note 
however that some things will be shared: for moment of time-symmetry data (see App C, 
1.2.11), GR and the above are identical. Just as well numerical relativity is developing away 
from such overly simple cases! Note that one can also build arbitrary-VF CS+V theory 
conformal thin sandwiches as a testbed for numerical relativity. 

2.3.2.3 IVP-CP formulation 



For conformal gravity, one would first solve the 4 IVP equations for (p and the longitudinal 
vector potential Wi, then redeclare (p and hij so that cp = 1 and then tackle the evolution 
problem in the ^-distinguished representation. As the IVP's are identical in all relevant 



ways to the GR IVP, they are therefore as well-posed. One then faces a CP step coupled 
to the solution of the LFE. As a procedure this is not novel: its analogue in GR is the 
maintenance of a (partial) gauge- fixing in the to study of the evolution equations. The 
LFE's in question here are also technically close to the usual ones in theoretical numerics. 
The novelty is in some of LFE's being enforced. 

It may be problematic that GR's p.d.e theorems are strictly tied to the harmonic gauge, 
which may be of limited computational value. This becomes harsher for the above conformal 
theories since now there is no apparent right to use harmonic coordinates, so there are no 
known theorems at all. 



2.3.2 Cosmology 



Whereas inhomogeneity can produce redshift, it is unlikely to produce anywhere near as 
much redshift (around 6) as is required to account for observations of distant galaxies. 
The standard cosmology furthermore makes use of much larger redshifts than this so as 
to explain features believed to be of earlier^^ origin than galactic structures, such as the 
microwave background or the abundances of the light elements. The GR cosmological effects 
mentioned above are capable of producing fluctuations in redshift but are small compared 
to the Hubble redshift. It is also worth noting that in GR the maximum gravitational 
redshift from light emerging from a star is 2 |363j . It remains to be checked whether these 
results also occur within conformal gravity. Finally if one were to attempt to explain larger 
redshifts cumulatively from clumpy effects, one would be faced with unobserved anisotropy 
correlated with the presence of intervening galactic structures. 

I rather propose some simpler, more conventional solutions to this cosmological difficulty. 
First, note that if the universe is described by one of our conformal theories, it need not 
be conformal gravity. The CS-I-V theories permit the standard GR cosmology if we so wish 
(though one might choose an appreciably different W). This might be regarded as being too 
similar to GR to generate interest as an alternative cosmology. However, there are theories 
of 'intermediate restrictiveness' between conformal gravity (no standard cosmology) and the 
CS-I-V theories (roughly the same as the standard cosmology). These examples do achieve 
some of the goals of alternative cosmologies beyond the illustrative value of the nonexistence 
of standard cosmology in conformal gravity as originally conceived. 

For example, in Kelleher's theory |211j . redshift is clearly available (p = fully constant 
rather than conformal gravity's p = 0), and its form is most rigid compared to GR (where 
p can vary with cosmic time.). Furthermore, the isotropic cosmology ansatz rigidly leads 
to a scale-factor a = ti independently of the matter content: 

totally constant = —= = V'^— cc a — . (611) 
^/h a dt ^ ^ 

Both the rigidity of this form and its insensitivity to the matter are completely unlike 
GR. From a GR perspective this looks like a flat universe with a peculiar but physically- 
admissible equation of state, and yet in truth it is a closed universe corresponding to a 
conformal theory. Of course, a = does not fit current observations well, and corresponds 
to a universe believed to be younger than some of its constituent stars! Nevertheless, this 
example deserves further investigation to delineate which of its predictions agree with GR 
and which differ. It may be seen as too rigid to be realistic. To this I offer three ways out 
(some of which may be combined), which I leave as a future project. 

First, there is no guarantee that this a = is a stable attr actor of inhomogeneous 
solutions. It may be that the theory permits other more typical asymptotic behaviours to 
which there corresponds no exact solution. This would furbish more interesting alternative 
cosmology examples. 

^■^In the case of element abundances, the observed abundance is explained both in terms of early-universe 
nucleosynthesis and the more contemporary influence of astrophysics. 



Second, Newton- York absolute time theory is somewhat less rigid. Had it been possible 
to include volume division into this theory, the subsequent cosmology would have a dust-like 
a = 1 2 . Unfortunately, the combination of this theory with volume division turns out to be 
Kelleher's theory again, because volume division interferes with the viability of the CMC 
bypass of the integral inconsistency (see C.l). I am yet to investigate the cosmology of 
Newton- York absolute time theory itself. 

Third, the means of coupling the matter may not yet have been determined. Note that 
the couplings in j211j . and jl98j are all different. In fact, if I Conf-BM the matter in 
conformal gravity, I get a Hubble redshift: for each species A, replacing with 

Are there rigid rules determining matter coupling or is it down to taste? Isenberg-6 
Murchadha-York type scalings (198j in a 'Barbour style' formulation would give Hubble 
expansion, furbishing conformal gravity with a more conventional cosmology. The form of 
the matter coupling may also change the rigidities of the resulting cosmologies in Kelleher 
and Newton- York absolute time theories. 



2.3.3 CS+V theory: interpretation and possible tests 

I arrive at arbitrary-VF 'York-style' conformal theories since embeddability is no longer 
necessary for consistency. But then W = 1 and W ^ 1 have very different interpretations. 
For, whereas for W = 1 the theory can be resliced away from the CMC stacking and the 
constraints still close for embeddability, this is not true for W ^ 1. 

For the W = 1 CS-I-V theory, the use of a CMC stack of hypersurfaces is thus ultimately 
a gauge choice, which is available provided that the LFE is soluble. It is a partial gauge 
choice since the point-identification (shift) between hypersurfaces in the stack is still un- 
specified. That the LFE encodes this gauge choice means that one is automatically provided 
with a partially gauge-Exed action. Any pathology in this CMC gauge might then go away 
under the valid procedure of reslicing so as to be in another gauge. 

But W 1 CS-I-V theories are not just written to favour a particular slicing or possess 
a privileged slicing. They are not generally resliceable because this leads to inconsistency. 
Thus these describe stacks of CMC slices and not pieces of GR-like spacetime. As a result 
of this, pathologies of the stack of CMC slices become real effects since reslicing to avoid 
these is not possible. 

Thus while W = 1 CS+V theory is just (a restriction of) CMC-shced GR, / 1 CS+V 
theories are quite distinct at a conceptual level. 

To complete the picture, the choice oi W = 1 slicing is available in conformal gravity, 
so one can use it to pass to other slicings by embeddability. But unlike in CS+V, these 
new slicings remember the privileged p = slicing, since the volume of these slices gets 
incorporated into the field equations. 

Not being able to reslice leads to differences, at least in principle. Suppose one has access 
to a compact object whose curvature profile permits (GR-inspired) collapse of the lapse to 
occur well outside its horizon. Then in a GR world, one could send an observer past where 
the lapse collapses, and as nothing physical occurs there and the observer is still safely away 
from the horizon, the observer can 'return to Earth' and report that W ^ 1 CS+V theory 
has been falsified. But in a 7^ 1 CS+V theory world, the observer would have become 
frozen forever where the lapse collapses and thus would not be able to return. Note that 
this is somewhat similar to the frozen star concept which predated the GR notion of black 
holes, except that the freezing could be occurring outside the horizon. Although W ^ 1 
CS+V theory could therefore be an improvement as regards strong cosmic censorship (the 



occurrence of singularities at all), there is also the GR- inspired possibility in the Eardley- 
Smarr example: that sufficiently steep curvature profiles generate too slow a collapse of the 
lapse to avoid singularities. Also, the collapse of the lapse would not save one from other 
non-curvature blowup pathologies usually regarded as singularities. 

W ^ 1 CS+V theory ought to also be testable much as Brans-Dicke theory is, by solar 
system tests. While plain 'arbitrary-VK GR' was suggested as another useful testbed for GR 
|215j (c.f the use of Brans-Dicke theory), this is not much good because it's inconsistent 
(by jl56j or the RWR result). What I have demonstrated however is that the idea of this 
'arbitrary-W GR' can be salvaged because it is a consistent theory provided that it is treated 
as a (non-reshceable) stack of CMC hypersurfaces, in which case it becomes CS+V theory. 
Thus I provide a 1-parameter family of theories to test against (not just extreme but also) 
everyday GR. Moreover, by the nature of the conformal mathematics in which they are 
so naturally expressed, they should be easily useable as testbeds for theoretical numerical 
relativity. This should require but minor modifications of existing codes. Conformal gravity 
could also be used / tested in this way. Numerical relativity uses conformal mathematics, not 
necessarily any notion of embeddability into GR-like spacetime. The alternative theories 
of this section may be seen as arising from taking this conformal mathematics in its own 
right as possibly a serious alternative to GR itself. Finally, W = 1 CS+V theory may be 
seen as a formulation of GR proper, and thus still be directly useful (both conceptually and 
as a tool) in theoretical numerical relativity even if the suggested alternatives to GR are 
dismissed or heavily bounded by future compact-object observations and analysis. 



VI TSA: criticism 



I sharpen the understanding of what the TSA is because of interest in why the impressive 
coUection of results in the GR case above arises in BFO and AB. I seek for tacit simpUcity 
postulates, survey which assumptions may be weakened and assess the thoroughness and 
plausibility of BFO and AB's principles, results and conjectures. I thus arrive at a number 
of possible TSA variations. I stress that this is not just about improving the axiomatization. 
One must be able to find a version that naturally accommodates spin-^ fermions coupled 
1) to GR if the TSA is to provide a set of plausible first principles for GR 2) To conformal 
gravity if this is to be a viable alternative. 

In VI. 1, I argue that the use of BSW-type actions in the implementation of R2 is prob- 
lematic. First, Barbour's use of it draws inspiration from the analogy between it and the 
Jacobi formulation of mechanics. But in VI. 1.1 I point out that the Jacobi formulation 
itself has limitations and a significant generalization. Furthermore in VI. 1.2 I point out 
that the analogy itself is inexact in several important respects. There is then a 'conformal 
Problem' and a 'notion of distance Problem' (VI. 1.3). Second, should the notion of 'BSW- 
type theories' not include all the theories that permit the BSW elimination process itself? 
But when I sketch this to include fermions in VI. 1.4, I find that not the BSW form but 
rather its significant generalization is obtained. Thus the inclusion of fermions will severely 
complicate the use of exhaustive proofs such as those in BFO and AB. Another example of 
such complications is how particles are to be included alongside fields in RWR. I further- 
more point out in VI. 1.5 that the usual higher derivative theories are not necessarily being 
excluded by BFO. These last two subsections include discussion of their HKT counterparts. 

In VI. 1.6, I formalize the second point above by showing that I could just as well use 
lapse-uneliminated but lapse eliminable actions for GR and conformal gravity. For GR, 
these actions may be studied within Kuchaf 's split spacetime framework (SSF) [2291 123U1 
I23H I232j . This framework brings attention to tilt and derivative coupling complications in 
general (VI. 2.1), which are however absent for the minimally-coupled scalar, and 'acciden- 
tally absent' for the Maxwell and Yang-Mills 1-forms, which are what the TSA picks out. 
But tilt is present for the massive analogues of these 1-forms. I deduce the relation between 
tilt and the existence of a generalized BSW form. In VI. 2. 2 I counter BFO's hope that just 
the known fundamental matter fields are being picked out by the TSA, by showing that the 
massless 2-form is also compatible. In VI. 2. 3, I find alternative reasons why the Maxwell 
1-form is singled out by BFO, from the point of view of the hypersurface framework. I end 
by explaining the complications that would follow were one to permit derivative-coupled 
1-forms. 

In VI. 3, I point out that it is consistent to take the bosonic sector of nature to be far 
simpler than 4-d generally covariant spacetime structure might have us believe. Out of 
the tilt, derivative coupling and best-matching kinematics of spacetime, the entirely spatial 
best-matching kinematics suffices to write down actions for GR coupled to a full enough set 
of bosonic fields to describe nature. 

In VI. 4.1, I show how all these results also hold true upon inclusion of spin-^ fermions. 
VI. 4. 2 lists further research topics for fermions in the light of the advances made. 

1 Problems with the use of BSW actions 
1.1 Insights from mechanics 

Barbour uses the analogy between the BSW formulation of GR (|226j) and the Jacobi for- 
mulation of classical mechanics (in 1.1.2.2). I consider below limitations both on the Jacobi 
formulation, and limitations on the applicability of this analogy in the next subsection. 



The Jacobi formulation has a catch: the conformal factor is not allowed to have zeros. 
If it does then the conformal transformation is only valid in regions where there are no 
such zeros. These zeros are physical barriers in mechanics. For they correspond to zero 
kinetic energy by the conservation of energy equation. As the configuration space metric is 
positive-definite, this means that the velocities must be zero there, so the zeros cannot be 
traversed. 

The working for the Lagrangian homogeneously quadratic in its velocities (|47|) leads to 
a configuration space geometry that is conformal to a Riemannian geometry, which may be 
useful since Riemannian geometry is well-understood. Let L(gc, Qa) be instead a completely 
general function. Then*^"^ 

I = L (^qc, t'dX ^ Kqt,q'f,)dX (613) 



rA2 

may be modified to 'j ~ / ^(^Cj^a)^-^ 

(614) 

by Routhian reduction, where R = F, some homogeneous linear function of the q'j^ |246j . 
For example, F could be a Finslerian metric function from which we could obtain a Fins- 
lerian metric /^g = ^ dq^dq' ' provided that F obeys further conditions [201 including the 
nondegeneracy of /ab- So in general the 'geometrization problem' of reducing the motion 
of a mechanical system to a problem of finding geodesies involves more than the study of 
Riemannian geometry. 

To some extent, there is conventional freedom in the choice of configuration space ge- 
ometry. This is because standard manoeuvres can alter whether it is Riemannian. This is 
because one is free in how many redundant configuration variables to include, and in the 
character of those variables (for example whether they all obey second-order ELE's). 

As a first example of this, consider the outcome of the Routhian reduction of (|47|) for 
without loss of generality g„ cyclic, 

R(gA, ^a) = 2 - Mr^n ) 'i^'iB + -^QA " V , (615) 

where V is a modified potential. So removing redundancy in a Riemann-geometrizable 
action by Routhian reduction can lead to a non-Riemann-geometrizable system, on account 
of the penultimate 'gyroscopic term' j246j . which is linear in the velocities. I consider the 
reverse of this procedure as a possible means of arriving at Riemannian geometry to describe 
systems with linear and quadratic terms. I observe that if the linear coefficients depend on 
configuration variables, then in general the quadratic structure becomes contaminated with 
these variables. 

As a second example, higher-than-quadratic systems may be put into quadratic form by 
Ostrogradsky reduction |37Sj . at the price of introducing extra configuration variables. 

1.2 Criticism of BSW Jacobi analogy 

As explained in 1.2.5, GR is an already-parameterized theory. This is made manifest by 
writing it in its BSW form ()226|) . Although this looks similar to the Jacobi action in 
mechanics, there are important differences. Recollect from 1.2.7.1 that the GR configuration 
space is infinite-d; with redundancies, one can consider it to be Superspace. The DeWitt 
supermetric is defined on Superspace pointwise. By use of DeWitt's 2-index to 1-index map 

^''Here, Newtonian time is t whilst A is a parameter. Dot is used for ^ in mechanics workings and dash 
for A-indices take values 1 to n, A indices take values 1 to n and t, and A indices take values 1 to n — 1. 



(|218|) . it may be represented by a 6 x 6 matrix, which is ( — h + + ++) and thus indefinite. 
The special case Minisuperspace (1.3.3) has a ( — minisuper metric, thus also indefinite. 
In contrast mechanics has a positive-definite kinetic term. 

Also, the BSW action has the 'bad' ordering, whereas the Jacobi action has the 'good' 
ordering. Below, I first consider Minisuperspace, for which this extra complication does not 
arise, since by homogeneity the 'good' Jacobi and 'bad' BSW orderings are equivalent. 

Finally, BSW's work led to the thin sandwich conjecture (1.2.9.1) the solubility of which 
features as a caveat in BFO's original paper. Being able to pose this conjecture for a 
theory amounts to being able to algebraically eliminate the lapse from its Lagrangian. This 
implies that the theory is timeless in Barbour's sense |3U1 134j . Note that the extension of 
the conjecture to include fundamental matter fields has only recently begun jl57j . Such 
investigations are required to assess the robustness of the conjecture to different theoretical 
settings, to see if in any circumstances it becomes advantageous to base numerical relativity 
calculations on the thin sandwich algorithm. 

1.3 Lack of validity of the BSW form 

1.3.1 Conformal Problem of zeros 

In perfect analogy with mechanics, vacuum GR has a conformally-related line element 
ds^ = (A + aR)ds'^, for which the motion associated with (|226() is geodesic |lU8j . But the 
observation in mechanics that such conformal transformations are only valid in regions where 
the conformal factor is nonzero still holds for GR. It is true that the details are different, 
due to the indefiniteness of the GR supermetric. This causes the zeros to be spurious rather 
than physical barriers j272j . For whilst a zero z of the potential corresponds to a zero of 
the kinetic term by virtue of the Hamiltonian constraint, this now means that the velocity 
need be null, not necessarily zero, because of the indefiniteness. Thus, rather than grinding 
to a halt, the motion may continue through z 'on the Superspace null cone', which is made 
up of perfectly reasonable Kasner universes. Nevertheless, the conformal transformation 
used to obtain geodesic motion is not valid, so it is questionable whether the BSW form 
is a 'geodesic principle', if in general it describes conformally untransformed non-geodesic 
curves for practical purposes. 

To illustrate that the presence of zeros in the potential term is an important occurrence 
in GR, note that Bianchi IX spacetimes have an infinity of such zeros as one approaches the 
cosmological singularity. And these spacetimes are important because of the BKL conjecture 
(see 1.3.3). The above argument was originally put forward by Burd and Tavakol to 
argue against the validity of the use of the 'Jacobi principle' to characterize chaos in GR 
|347j . Our point is that this argument holds against any use, BFO's included, of the BSW 
form in Minisuperspace models of the early universe in GR. 

The way out of this argument that I suggest is to abstain from the self-infliction of 
spurious zeros by not performing the conformal transformation in the first place, thus 
abandoning the interpretation of the BSW form as a geodesic principle in GR. Conformal 
gravity however is distinct from GR and has no cosmological singularity, so arguments based 
on the BKL conjecture are not applicable there. Conformal gravity's zeros are real as in 
mechanics, because is positive-definite, and Barbour and O Murchadha use this to argue 
that topologies with i? < at any point are not allowed |40j. 

1.3.2 The BSW form is an unknown notion of distance 

BFO called the local square root ordering 'bad' because it gives one constraint per space 
point, which would usually render a theory trivial by overconstraining due to the ensuing 
cascade of secondary constraints. Yet GR contrives to survive this because of its hidden 
foliation invariance jHH] • However Giulini |157j has pointed out another reason why the local 



square root ordering is bad: it does not give rise to known geometry. Below, we extend his 
finite-d counterexample to the geometry being Finslerian. 

The BSW form as a notion of distance provides as the 'full metric' on Superspace 
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(616) 

where v"^ = h"^ = h""^ by DeWitt's 2-index to 1-index map (|218|) and where hats denote 
unit 'vectors'. So in general, G^^G^j^j^ = is a sufficient condition for the full metric 

to be degenerate and hence not Finsler (Giulini's particular example had a 1-d so this 
always occurred). But if G^j^G^j^ ^ 0, the full metric is not a function (it contains delta 
functions and integrals). So using the BSW form as a notion of distance leads to unknown 
geometry, so there is no scope for the practical application of the BSW form as a geodesic 
principle. 

This is to be contrasted with the global square root, for which the above procedure gives 
instead (semi)Riemannian geometry. For Minisuperspace, the local square root working 
presented does indeed collapse to coincide with this global square root working, and the 
resulting (semi)Riemannian geometry is of considerable use in Minisuperspace quantum 
cosmology j272j . Whereas conformal gravity is still plagued by the notion of distance 
difficulty, the GR issue of the sheaves (1.2.8.1) is resolved by privileged foliation. I should 
emphasize that I do not know of any reason why degeneracy due to stratification and 
degeneracy due to sheaves are at all related. What I am sure about is that the badness of 
the local square root geometry has nothing to do with this since this badness is already there 
for field theories with simple configuration spaces. I conclude that it is not appropriate to 
use the BSW form as a geodesic principle on Superspace. 



1.4 The fermionic contribution to the action is Hnear 



Since the kinetic terms of the bosons of nature are also quadratic in their velocities, I can 
use the modifications 



jg — , js + T^ , A + aR — >A + aR + [J^ 
to accommodate bosonic fields B/\ in a BSW-type action, 

-B 



\^ = j dxj d^xVhVA + (tR + Vts + T^ 



(617) 
(618) 



(619) 



This local square root encodes the correct Hamiltonian constraint for the gravity-boson 
system. 

Although the pointwise Riemannian kinetic metric is larger than the DeWitt super- 
metric, in the case of minimally-coupled matter it contains the DeWitt supermetric as an 
isolated block: 

GAsihab) 
Hf^^'-^iKb) 
If this is the case, it makes sense to study the pure gravity part by itself, which is a prominent 
feature of almost all the examples hitherto studied in the TSA. I identify this as a tacit 
simplicity requirement, for without it the matter d.o.f's interfere with the gravitational 
ones, so it makes no sense then to study gravity first and then 'add on' matter In Brans- 
Dicke (BD) theory, this is not immediately the case: this is an example in which there 
are gravity-boson kinetic cross-terms Gabx [s^e ()c{69() ] in the pointwise Riemannian kinetic 
metric unlike in (|619j) . The TSA form is then 
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Thus, the metric and dilatonic fields form together a theory of gravity with 3 d.o.f 's. Com- 
pared to the more disturbing examples in VI. 2. 4, this is a mild example of derivative cou- 
pling.®^ 

I now begin to consider whether and how the 3-space formulation can accommodate 
spin-i fermionic fields, F^. Following the strategy employed above for bosons, the BSW 
working becomes 

f = j d\j ^?xVhNL{habXb■,^^■,N■FJ^,F^) 



dA / d?x\/h 



(621) 



because is linear in F/\.^^ Then the usual trick for eliminating N does not touch T^, 
which is left outside the square root: 

f = j dxj d^xVh (^^A + aR + (JfVJ^ + T^) . (622) 

The local square root constraint encodes the correct gravity-fermion Hamiltonian constraint 

^n = -Vh{A + aR + \JF) + ^(^pop-^p^^ = 0. (623) 

I postpone the issue of the precise form of the best matching to be used until VI.4.1 My 
concern in this section is the complication of the configuration space geometry due to the 
inclusion of fermions. 

For now the elimination procedure is analogous not to the Jacobi working but rather to 
its generalization ()614() . So even the pointwise geometry of the gravity-fermion configuration 

space is now compromised: \/ A + aR + U^VT^+T^ could sometimes be a Finslerian metric 
function. By allowing (|621() . one is opening the door to all sorts of complicated possible 
actions, such as 

1) VG^^-'^^^Si-'ZE,. 

2) Arbitrarily complicated compositions of such roots, powers and sums. 

3) More generally, Kqq^, where Kq is allowed to be an arbitrary function of not only the 
QQ but also of the Q - 1 independent ratios of the velocities. 

4) The above examples could all be Finslerian or fail to be so by being degenerate. They 
could also fail to be Finslerian if the Kq are permitted to be functionals of overall degree 
in the velocities, which we can take to be a growth of the local-global square root ambiguity. 

Here are some examples of these, 
i) Inclusion of particles: for 1 particle. 



1(1)= / d^x. 



(^Xj, x^^^) ^ygABX^i^= I d\ I d^xVhN5^^^ (^Xj, x^^^) ytS^S^ 



dA 



Y^/iafea;(i)«x(i)'' - t2 (624) 



Furthermore, if one admits the conformal redefinition of the metric and scalar d.o.f 's, blockwise isolation 
of the form I619II is permitted, thus freeing the theory from derivative coupling. 

show in VI. 2 that the algebraic dependence on A*' emergent from such decompositions requires rigorous 
justification. I provide this for 162111 in VI.4.1 



by use of proper time. We could have (i) = (1) to (n) of these, in which case want one root 



per particle (bad ordering) 2, ... n) = / dA y habX^^^"" 



W={i) 

(625) 



to recover separate constraints p^-^^ — E'^^-^ = — -E'o(j) 

(626) 

for each particle from each square root in the sum. In a RWR context, one would Diff-BM 
the particle velocities (which act on everything) 



|(n)Difr= / dA xVh (VkT^^ + Vh ^ yj habQ^xWii^xi^^' - ii^t^ 



(i)=W 



(627) 



Then one has a momentum constraint with a standard vector and scalar contribution to it. 

ii) If without loss of generality the first m {< n) particles are to be charged, the linear piece 

/ dA ^ i±1j)Aa - iA^) (628) 

a)=(i) 

is to be added. Note that the particles are not to be U(1)-BM; the well-known correction 
for charged particles is to the momenta and not the velocities, and this arises just so for 
the above form. 

iii) The phenomenological matter contribution occurs linearly outside the BSW square 
root [c.f (|228|) ]. Note however that if I study this system in more detail (i.e in terms of 
fundamental matter), then I find that this term really belongs inside the square root. For 
if one admits 

that p o h-N{n+2Vhp)-C{ni-Vhji) 

(629) 

should really have a contribution from matter momenta, and uses (|182j) for fundamental p 
and (done here for the scalar field example), then one has 

p o h + 7rci-N(n + ^[^^ + + _ ^^{ri^ + 5^c^d^<;) (630) 



which wraps up to form yha\R+\J'"-\ -^-^ — 



rather than Vha — 2p+ j"^^ 



(631) 



(632) 



so that its RI form is v AB rather than v AB + C . 

iv) I will eventually admit some ugly 1-form theories to the TSA via horrendous combina- 
tions of type 2) in VII. 1.2. 

We would therefore need to modify the RI BSW implementation of principle R2 to a 
general BSW implementation capable of including fermions. This amounts to dropping the 
requirement of the matter field kinetic term being homogeneously quadratic in its velocities, 
thus bringing the TSA matter simplicity 5 into alignment with Teitelboim's assumptions. 
We note that with increasing generality the possibility of uniqueness proofs becomes more 



remote. Although some aims of the TSA such as a full derivation of the universal null 
cone would require some level of uniqueness proofs for spin-^ fermions, my strategy in this 
chapter is to show that spin-^ fermions coupled to GR do possess a 3-space formulation 
and also to point out that the uniqueness results may have to be generalized in view of the 
generalization of the BSW form required in this section. 

Could one not choose to geometrize the gravity-fermion system as a Riemannian geom- 
etry instead, by use of the reverse of Routhian reduction? But the coefficients of the linear 
fermionic velocities in the Einstein-Dirac system contain fermionic variables, so the result- 
ing Riemannian geometry's coefficients would contain the fermionic variables in addition to 
the metric. This is a breach of the DeWitt structure, since it means that contact is lost with 
DeWitt's study of the configuration space of pure GR |1U61 llU8j . So this choice also looks 
highly undesirable. 

For 40 years the natural accommodation of spin-^ fermions in geometrodynamics has 
been problematic (see 1.2.7.2, II. 1). So this is a big demand on the TSA, and one which must 
be met if the TSA is truly to describe nature. My demands here are less than Wheeler's 
in |374j : I am after a route to relativity with all matter 'added on' rather than a complete 
unified theory. The HKT route appears also to be incomplete at this stage: Teitelboim 
was unable to find a hypersurface deformation explanation for spin-^ fermions |351j . Thus 
when this work was started, all forms of the seventh route to relativity were incomplete 
with respect to the inclusion of spin-^ fermions. In VI.4.1, I point out the natural existence 
of GR-spin-^ theory within the TSA. 

1.5 Higher derivative theories 

I now argue against the significance of the preclusion of higher derivative theories by BFO. 
For the precluded theories are easily seen not to be the usual higher derivative theories. 
There are two simple ways of noticing this. First, the primary constraints encoded by the 
BFO theories with arbitrary \/{hij, hij^f^, ...) will always be of the form 

^/h'H = -Vhy + -^(^pop-^p'^^= 0, (633) 

while the usual higher derivative theories give something more complicated. Second, BFO's 
theories have fourth-order terms in their potentials but their kinetic terms remain quadratic 
in the velocities, whilst the usual higher derivative theories' kinetic terms are quartic in the 
velocities. This mismatch of derivatives between T and V for V ^ aR + A, overrules the 
theories from within the generally covariant spacetime framework, so BFO are doing nothing 
more than general covariance can do in this case. On the other hand, the potentials BFO 
consider are not absurd given their ontology; such potentials were specifically inquired about 
by Wheeler in |376j . 

It is not clear whether the usual higher derivative theories could be written in some 
generalized BSW form. The form would either be considerably more complicated than that 
of pure GR or not exist at all. Which of these is actually true should be checked case by 
case. I consider this to be a worthy project in its own right by the final comment in VI. 1.2, 
since this problem may be phrased as 'for which higher derivative theories can the thin 
sandwich formulation be posed?' To illustrate why there is the possibility of nonexistence, 
consider the simplest example, R + xR? theory. The full doubly-contracted Gauss equation 

is R = R- a{K o K - K'^) + 2aDa{n^Di,n'' - n^D^n^) 

(634) 

and, whereas one may discard the divergence term in the 3 + 1 split of R, in the 3 + 1 split of 
R? , this divergence is multiplied by R and so cannot similarly be discarded. So it is unlikely 
that the elimination of N will be algebraic in such theories, which is a requirement for the 



BSW procedure. Were this algebraic elimination possible, we would get more complicated 
expressions than the local square root form from it. Indeed, higher derivative theories are 
known to have considerably more complicated canonical formulations than GR [3421 167j ; it 
is standard to treat them by a variant of Ostrogradsky reduction adapted to constrained 
systems p7| . 

It is worth commenting that MKT's derivation of 7i being quadratic in its momenta and 
containing at most second derivatives may also be interpreted as tainted, since it comes 
about by restricting the gravity to have two d.o.f 's, as opposed to e.g. the three of -R + xE? 
theory or of Brans-Dicke theory. Thus I do not foresee that any variant of the seventh route 
to relativity will be able to find a way round the second-order derivative assumption of the 
other routes. 



1.6 Lapse-uneliminated actions 



I have explained why the interpretation of the BSW form as a geodesic principle is subject 
to considerable complications, and that it may obscure which theories are permitted or 
forbidden in the TSA. I next show that the use of the manifestly lapse-eliminated BSW form, 
and consequently the Problems with its interpretation, may be circumvented by the use 
of lapse-uneliminated but lapse eliminahle actions (just as the Jacobi and Euler-Lagrange 
interpretations of mechanics are equivalent). It is easy to show that the equations of motion 
that follow from the lapse-uneliminated 3-1-1 'ADM' Lagrangian H224|) are weakly equivalent 
to the BSW ones: 

= VhN[h'\aR+AyaR'^ - -^{p'^^pj - —p'^p) 



1 / X 

Vh{aR + A)--j=i^pop- — p2 



^ ^ ' h'm , (635) 



/ BSW ^ 

and similarly when matter terms are included. I use arbitrary a and W above to simulta- 
neously treat the GR and strong gravity cases. The ADM propagation of the Hamiltonian 



constraint is slightly simpler than the BSW one. 



n = -D\N'ni)+£^n 



(636) 

for IF = 1 or (7 = 0, where it is understood that the evolution is carried out by the ADM 
ELE's or their strong gravity analogues. 

I next check that using uneliminated actions does not damage the conformal branch of 
the TSA. The conformal gravity action ()334|) is equivalent to 



I = 

where the lapse is N 

' dp"^' 



dA- 



8D2 



H 2 — 

VTi{4>) 



(637) 



N- uneliminated 



Qpi-j 



dX 



The following equivalent of (|635|) holds: 



N-eliminated 



3V 



c 



(638) 



for ^C^_Vhct^ 



2 



<j[R ) + , 



(639) 

the conformal gravity equivalent of the Hamiltonian constraint. 

In the next section, I develop a strategy involving the study of lapse-uneliminated ac- 
tions. This represents a first step in disentangling Barbour's no time j3Ul 1311 134j and no 
scale |35( II Ij ideas. It also permits me to investigate which standard theories exist accord- 
ing to the other TSA rules, by inspection of formalisms of these theories. Where possible, 
one could then eliminate the lapse to cast theories into TSA form. This is not always 
possible: some perfectly good theories appear to have no actions which are simultaneously 
best-matched and BSW reformulable (in having an algebraically-eliminable lapse). Thus 
the uneliminated form can be used to help test whether the TSA is or can be made to be 
a satisfactory scheme for all of nature. 

I furthermore next use this lapse-uneliminated formulation to interpret the GR branch 
of the TSA within Kuchai^'s hypersurface or split spacetime framework (SSF), which has 
striking interpretational consequences. 

2 The 3-space approach and the spht spacetime framework 
2.1 Kuchaf's hypersurface or split spacetime framework 



In his series of four papers, Kuchaf considers the deformation of a hypersurface |229j . the 
kinematics of tensor fields on the hypersurface |23Uj . the dynamics of the fields split with 
respect to the the hypersurface |232j . and the geometrodynamics of the fields |232j . The 
fields are decomposed into perpendicular and tangential parts, as in 1.2.1 (N.B e = —1 
in this section). I am mainly concerned with 1-forms, for which the decomposition is 
Aa = n-AA± + e\Aa. A deformation at a point p of a hypersurface S may be decomposed 
as in (fig 12). 





b) 

Fi glirc 12: a)the translation part is such that <y.{p) 7^ 0, [daO^\{p) = 0. b) tilt part is such that = 0, 

I follow Kuchaf's use of first-order actions. For the 1-form, this amounts to rewriting 
the second-order action 1. = / d^x v1^L(^,, V^Ac, Sue) 

by setting A"^^ = 7-— — - — r and using the Legendre transformation 
d{yBAA) 

(A/i, V^Ac, L) — > (j4^, Acs, L), where the Lagrangian potential is 

L = [X'^^V bAa — \-]{Aa,Xbc:9de)- Then the 'hypersurface Lagrangian' is 

6f3\^ = J d^x{TT^5f^A^ + TT^5^Aa-a - (3^aK) (640) 
where the j4j-contribution to the momentum constraint a'^^q obtained from 

^ = S^ = 6^-£f, (641) 



(see fig 13) integrating by parts where necessary, and the ^j-contribution to the Hamiltonian 
constraint on a fixed background pj-i° may be further decomposed into its translation and 
tilt parts,®^ 

= aK + A^/ . (642) 

The translational part a"^" may contain a term 2xP°'^Kab due to the possibility of derivative 
coupling of the metric to the 1-form, whilst the remainder of a"^? is denoted by ATLt- 

aK = A-Ht + 2AP''''Kab . (643) 




Fi gure 13: The change along an arbitrary deformation of the hypersurface E is split according to 5^ = 
Sa + Sp. Kuchai^ shows that = 5p when acting on spatial tensors in Sec 7 of |229| and Sec 3 of |281| 

2.2 SSF as a TSA tool 

I win show that BFO original TSA is complete for the bosons of nature, and more. 
For the 1-form field, using the decomposition 

A^^ = X^^n^n^ + X'^e^n^ + A^^n^ef + A'^^e^ef (644) 

and A**"*" = vr" , A"*""*" = vr-*- (by the definition of canonical momentum) , one obtains 

AHt = L + VhiX^'^DaA^ - X'^'DaAk) . (645) 



One also requires a^""* = ^(-A^'^l A^l'') + ^^A^"'') - ^("vr'')) . 

(646) 

For the 1-form, A"*"" and A"^ play the role of Lagrange multipliers; one would then use the 
corresponding multiplier equations to attempt to eliminate the multipliers from (|640() . In 
my examples below, A±_ will also occur as a multiplier, but this is generally not the case. 

The above sort of decomposition holds for any rank of tensor field. W^, P"'^ and £p are 
universal for each rank, whereas Tit contains L, which has further details of the particular 
field in question. These three universal features are the kinematics due to the presupposition 
of spacetime. The £p contribution is 'shift kinematics', whilst the tilt contribution is 'lapse 
kinematics'. 

The point of Kuchaf 's papers is to construct very general consistent matter theories by 
presupposing spacetime and correctly implementing the resulting kinematics. I show below 
that in not presupposing spacetime, BFO are attempting to construct consistent theories 
by using shift kinematics (Diff-BM) alone, and thus attempting to deny the presence of any 
'lapse kinematics' in nature. This turns out to be remarkably successful for the bosonic 
theories of nature. 

*^The subscript / denotes the tilt part and the subscript t denotes the translational part. 



I begin by noting that nonderivative-coupled fields are a lot simpler to deal with than 
derivative-coupled ones. I then ask which fields are included in this simpler case, in which 
the matter fields do not affect the gravitational part of the Hamiltonian constraint so that 
the gravitational momenta remain independent of the matter fields. Thus it became clear 
to me that this is a tacit assumption in almost all of BFO's work. 

TSA Gravity— Matter Simplicity : the implementation of 'adding on' matter is for 
matter contributions that do not interfere with the structure of the gravitational theory. 
This amounts to the absence of Christoffel symbols in the matter Lagrangians, which is true 
of minimally-coupled scalar fields (I?a? = f^a?) and of Maxwell and Yang-Mills theories and 
their massive counterparts (since DaA^ — DhAa = daAi, — di,Aa). Thus it suffices to start off 
by considering the nonderivative-coupled case on the grounds that it includes all the fields 
hitherto thought to fit in with the BFO scheme, and also the massive 1-form fields. 
Consider then the Proca 1-form. Its Lagrangian is 

L^roca = -V[^^B]V[^A^1 - ^AaA^ , (647) 



with corresponding Lagrangian potential L = — -X'^^^^ X[ab]~^ — ^AaA"^ . 

(648) 



Whereas a'H°i has in fact been completed to a divergence, ,.J~L"/ = A°'DaTT'^+A±DaTT°' 

T (A) / 

(649) 

suffices to generate the tilt change of A^ and Aa for the universal 1-form (see Sec 6 of |231j ) . 
The first term of this vanishes since vr-*- = by antisymmetry for the 1-forms described by 
(|647|) . Also A-P'*'' = by antisymmetry so 



'i^""^"' + ^vr.vr- + "^{AaA- - A\) - X^'' A\ 



+ A^DaTT" . (650) 



by ()6421 164:]|) . The multiplier equation for Xab gives Xab = —'^D\pAa^ = Bah ■ 

For m 7^ 0, the multiplier equation for Ax gives A± = -=Da'T^°' 

rn^Vh 

and elimination of the multipliers in (|65U|) using 1)6511 1652|l gives 



(651) 



(652) 



which is non-ultralocal in the momenta. I note that this does nothing to eliminate the 
remaining term in the tilt: the Proca field has nonzero tilt. 

But, for m = 0, the A± multiplier equation gives instead the Gauss constraint of 

electromagnetism Q = Dair"' ~ . 

(654) 

This would not usually permit A±_ to be eliminated from ()653p but the final form of jiJ-L° 



for m = is aW° = ^B'^'^Bab + ^TTaTT'^+A^, {DaTT^ ^ 0) , 



(655) 



so the cofactor of A± in H64U|) weakly vanishes by (|654|) . so A± may be taken to 'accidentally' 
drop out. This means that the tilt of the Maxwell field may be taken to be zero. The tilt is 
also zero for the metric and for the scalar field. So far all these fields are allowed by BFO 
and have no tilt, whereas the so-far-disallowed Proca field has tilt. 

I can begin to relate this occurrence to the BSW or generalized BSW implementation 
of R2 . For, suppose an action has a piece depending on daCt in it. Then the immediate 
elimination of a from it is not algebraic, so the procedure of BSW is not possible. By 
definition, the tilt part of the Hamiltonian constraint is built from the daO: contribution 
using integration by parts. But, for the ^_|_ -eliminated Proca Lagrangian, this integration 
by parts gives a term that is non-ultralocal in the momenta, {Dair"')'^, which again contains 
daO: within. Thus, for this formulation of Proca theory, one cannot build a TSA Einstein- 
Proca action to start off with. Of importance, this difficulty with spatial derivatives was 
not foreseen in the simple analogy with the Jacobi principle in mechanics, where there is 
only one independent variable. 

The above argument requires refinement from the treatment of further important phys- 
ical examples. This is a fast method of finding matter theories compatible with the TSA 
by the following argument. If there is no derivative coupling and if one can arrange for the 
tilt to play no part in a formulation of a matter theory, then all that is left of the hyper- 
surface kinematics is the shift kinematics, which is BM [general Rl]. But complying with 
hypersurface kinematics is a guarantee for consistency for established spacetime theories so 
in these cases BM suffices for consistency. 

First, I consider K interacting 1-forms with Lagrangian^^ . 

(656) 

I define = — — - — ■—- and the corresponding Lagrangian potential is 

o[VbA'^) 

r_ ^\[AB].M Sep aD \ABB , ''^'^ A /l^M (RK7\ 

^-~^^M -^[AB] - y/BDE^B^A-^ + —AbmA . (657) 

The overall tilt contribution is now the sum of the tilt contributions of the individual 
fields, so (Am)^/ ~ A^DaTr^ suffices to generate the tilt change. Again, Am^"^^ = by 
antisymmetry so 



A, 



M° = Vh 



^Kh^m + ^■^a ■^M + ~^(^a " ^± ^±m) " \M.A[a,b] 



+A'^T>A - |/MPQ(^A"^^Ar^Q + 2^^AlA^) (658) 
by (|6421 The multipliers are and A^, with corresponding multiplier equations 

A^ = -2D^,A'^ ^ , (659) 

^M± = ^D,7r£^ = ^{DaTT^M + gJl^MPTT^^A^) (660) 



m 



\/h m?\/h 



for m 7^ 0. I thus obtain the eliminated form 

(661) 



**By g/'^^BC ^ strictly mean g'^ f^^c where A indexes each gauge subgroup in a direct product. Then 
each such gauge subgroup can be associated with a distinct coupling constant g'^. 



and the massive Yang-Mills field is left with nonzero tilt. For m = 0, the second multiplier 



equation gives instead the Yang-Mills Gauss constraint = Davr^*^ ~ . 

(662) 

In this case, the tilt is nonzero, but the Yang-Mills Gauss constraint 'accidentally' enables 
the derivative part of the tilt to be converted into an algebraic expression, which then 
happens to cancel with part of the Lagrangian potential: 

AmW° = ^B^B^ + ^^^^M + AfiDaTT^^ + g/LMPVT^^^r ^ 0) . (663) 
Second, I consider U(l) 1-form-scalar gauge theory [c.f (llflSmiO]) ] 



Luaf = -V[A^B]V[^^^] + (a^? - ieAAx){d\* + ieA\*) - . (664) 

Now, in addition to A'^'^, I define ri'^ = —— — ^ and = 7-— 7 ,so the Lagrangian 

5(VaO 5(Va?*) 

potential is L = --X^^^^\[ab\ + ^^aA"^ + /x^i^a - ieAA{Cv^ - ?;U^) + . 

(665) 

(A)'^^° = A±Da'7T"' still suffices to generate the tilt (as scalars contribute no tilt). 



,,,.,A^'"'' = 0, and ,,,,,aK = ^ 



1 1/1 \ rn? 



-ie \^Aa['^*i^'^ - - --^[<^*7r^. - (^tt^' 

(666) 

The Xab multiplier equation is H651|) again, whilst the A± multiplier equation is now 

gu(i) = DaTT^ + ie(?*7r,. - ?^,) = , (667) 

which can be explained in terms of electromagnetism now having a fundamental source. In 
constructing f,,;*,A'^° from ()6421 164^ I666() . I can convert the tilt to an algebraic expression 
by the sourced Gauss law (|667j) which again happens to cancel with a contribution from 

the Lagrangian potential: ?,?*,aW° = -A"''yl[a,f,] - (M'' + z^'')?,a + (A)W/ +A,<;,f* K 

1 1/1 \ 



+A^[DaVr"+ie(?*7r,.-?7r,) ^ 0] . 

(668) 

It is not too hard to show that the last two 'accidents' also 'accidentally' conspire together 
to wipe out the tilt contribution in Yang-Mills 1-form-scalar gauge theory 1)1191 112U1 1129|) 
used to obtain broken SU(2) x U(l) bosons for the electroweak force. This theory is also 
obviously nonderivative-coupled. 



I now present a more general treatment about the occurrence of these 'accidents'. They 
arise from eliminating A± from its multiplier equation. For this to make sense, A± must 
be a multiplier, thus tt"*" = 0. Then for general L, the multiplier equation is 

dL 

^ + Dan" = . (669) 

Then the requirement that A±DaTr" + L be independent of A± on using H669|) means that 
—^±-£^ + L is independent of A±. Thus the 'accidents' occur whenever the Lagrangian 
potential is linear in A±. 

The above two examples have broadened my understanding, and lead to the following 
precise reformulation of the BSW-type version of RI[R2]) within the SSF as 
RI-castable[R2] : I use lapse-uneliminated actions homogeneously quadratic in their ve- 
locities and permit only those for which the matter contributes a weakly vanishing tilt. 
I can combine this with the generalization of dropping the homogeneously quadratic re- 
quirement, in anticipation of the inclusion of spin-^ fermions. 

So for Einstein-Maxwell theory, Einstein- Yang-Mills theory, and their corresponding 
scalar gauge theories, 1) the absence of derivative coupling guarantees that they can be 
coupled to GR without disrupting its canonical structure as tacitly assumed by BFO. 2) 
The absence of tilt guarantees that the resulting coupled theories can be put into BSW 
form. Because the theories have homogeneously quadratic kinetic terms, this is the most 
straightforward notion of BSW form. 3) Now, the SSF guarantees consistency if all the 
required kinematics are included. But the only sort of kinematics left is BM. Thus, all 
these theories are guaranteed to exist as TSA theories. 

These workings begin to show (if one presupposes spacetime), what sorts of obstacles 
in Kuchaf's split spacetime ontology might be regarded as responsible for the uniqueness 
results for bosonic matter when one starts from BFO's 3-d ontology. 



2.3 The TSA allows more than the fields of nature 



I have described how the fields hitherto known to be permitted by the TSA may be iden- 
tified within the SSF. These fields all have just BM kinematics and no other significant 
universal feature (tilt or derivative coupling). Are these fields then the known fundamental 
matter fields, which somehow have less universal kinematic features than generally covari- 
ant spacetime structure would lead one to expect? This question may be subdivided as 
follows. Does the TSA single out only the known fundamental matter fields? Does the TSA 
single out all the known fundamental matter fields? Kuchaf makes no big deal about the 
simplified form weakly equivalent to his decomposition of the electromagnetic field, because 
it does not close to reproduce the Dirac Algebra (see Sees 11-12 of j231j ): it only does so 
modulo Q. He takes this to be an inconvenience, one which can be got round by adhering 
to the form directly obtained from the decomposition, whereas BFO take it as a virtue that 
the simplified form 'points out' the new constraint, Q, as an integrability condition. 

The first question can be answered by counterexample. One should interpret the ques- 
tion as coarsely as possible; for example one could argue that the TSA is not capable of 
restricting the possibility of Yang-Mills theory to the gauge groups conventionally used to 
describe nature, or that by no means is massless 1-form-scalar gauge theory guaranteed to 
occur in nature. Rather than such subcases or effects due to interaction terms, I find it 
more satisfactory to construct a distinct matter theory which is not known to be present in 
nature. The last subsection has put me into a good position to do this. 

2 

Consider the 2-form ^ab Lagrangian L* = -V[c'^ab]'^^'^^^^^-^^'^ab'^^^ , 

(670) 



define \^^^ = -r-= — r and use the Legendre transformation to obtain the 

d{Vc^AB) 

Lagrangian potential L = -^X^^^'^^ \[abc] + -y^ ^^^^^ • 

(671) 

Then ($)'^^° = '^'^ ±bD a^^""^ suffices to generate the 2-form tilt and = by antisymme- 
try. The multipliers are A"'"^ and A^a with corresponding multiplier equations 

Xabc = -2L»rf,^>„M = Babe and, for m / 0, = ^^DaTi"^ , 

m^V h 

(672) 

which may be used to eliminate the multipliers, giving rise to the non-ultralocal form 



= ^B-^'^Babc + + -4^/i6d(^aVr"')(^cVr='^) + ^1>a6^"' . (673) 



But for m = 0, the $_Lfe multiplier constraint is Q = Dair' 



ab 



(674) 



and = l^B'''^Babc + -^7T''''TTab + 2^±b{Da7T''' « 0) . 

(675) 

So the massless 2-form's tilt is zero and this leads to the elimination of ^±b by the same sort 
of 'accident' that permits A± to be eliminated in electromagnetism. So, for this massless 2- 
form, BM is equivalent to all the spacetime hypersurface kinematics, and as this guarantees 
closure, we deduce that there exists a resulting TSA theory starting with 



I* 



JdxJ d^xVh^R + Z?[,ci>,5]Z)[-cI>"''l^Tg + h-bh-d{^[ab] - £^^abm[cd] - £^^cd) , 

(676) 

which leads to the enforcement of l|674l) . which is subsequently encoded by the introduction 
of an auxiliary variable Gf,. This working should also hold for any p-form for p < d, the 
number of spatial dimensions. Yet only the p = 1 case, electromagnetism, is known to occur. 
This is evidence against BFO's speculation that the TSA "hints at partial unification" of 
gravity and electromagnetism, since these extra unknown fields would also be included as 
naturally as the electromagnetic field. Note also that the ingredients of low-energy string 
theory are getting included rather than excluded: p-forms, the dilatonic coupling... These 
are signs that the TSA is not as restrictive as BFO originally hoped. 

The second question must be answered exhaustively. It is the minimal requirement 
for the TSA to be taken seriously as a description of nature. The TSA gives gravity, 
electromagnetism and Yang-Mills theories including those of the strong and electroweak 
nuclear forces. One may argue that disallowing fundamental Proca fields is unimportant 
- use spontaneous symmetry breaking - but the inability at this stage to treat the weak 
bosons directly is disturbing. The next project is the inclusion of spin-^ fermions (see VI. 4), 
in order to complete the TSA for the theories of the simplest free fundamental fields that 
can account for nature (though recollect that this is subject to the simplicity in 1.2.3.4). I 
then follow by investigating all the interactions involved in the Standard Model j294j . 

2.4 Derivative coupling and the 3-space 1-form ansatz 

In their study of 1-forms, BFO used a BSW-type action with potential term 

Ua = C'''"^''DhAaDdA, + — ^aA^ (677) 



which is natural within their 3-space ontology. They then obtain ^Ti and ^Tii in the usual 
3-space way (from the local square root and from ^*-variation) . Then the propagation of ^Ti 
enforces Ci = —C2 , C3 = and also Q, whose propagation then enforces M = 0. Having 
thus discovered that a new (Abelian) gauge symmetry is present, Q is then encoded by the 
corresponding U(1)-BM, by introduction of an auxiliary velocity Q [c.f H487|) ]. Identifying 
Q = Aq, this is a derivation of Einstein-Maxwell theory for A a = [AQ,Ai]. 

I find it profitable to also explain this occurrence starting from the 4-d ontology of the 
SSF. The natural choice of 1-form potential and kinetic terms 



would then arise from the decomposition of L"^ = -C^^'^'^V bAaV dAq —AaA-^ 



Using the following set of four results from Sec 2 of j23Uj . 



(678) 



(679) 



aV±Aa = -SgAa - aKabA^ - A±daa , 



VbAa = DbAa - A±Kab , 



(680) 
(681) 



I obtain that 
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A^daa 
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5pA^ + A'^daa 
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SnAa + A^daa \ ^ 
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KacA' 
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+ KacA' {D^A^ - K\A') 



5pA^ + A'dca 
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{D^Aa - A^K) 



(682) 



M2 



-C'^'^^DbAa - A^Kab){DdAc - A^K^) - —{A^A^ - A^A' 



(683) 



Then, if one adopts the spacetime ontology and then imports BFO's 3-space assumptions 
into it, one finds the following 'spacetime explanations' for BFO's uniqueness results. 

First, BFO's tacit assumption that addition of a 1-form Aa does not affect the 3- 



geometry part of the action can be phrased as there being no derivative coupling, a-P = 0, 
which using (IM implies that A^'"'') = 0, vr^ = -A-^^ Since A^^ = -2C^^^^V dAc, this 
by itseJf implies Ci = — C2, C3 = 0. 

In I wrote that Barbour would argue that is a velocity, since it occurs as a 
correction to velocities in the kinetic term [HS]^^ (following from its auxiliary status, just 
as N and ^ are velocities), it makes sense for the 3-space ansatz to contain no B^A-*-. But 
from (|683|) . this by itself is also equivalent to C\ = — C2, C3 = from the 4-d perspective. 
Also, inspecting (|683() for Maxwell theory reveals that 



-cm(3+l) 



Da{-aA^)Y - CiD'A^{DbAa - Da A, 



(684) 



*^When writing the thesis, I derived that this is essentiaUy correct, but care is needed for the reason given 
here: one could just as weU have products of a velocity with coordinates. 



So in fact B = —aA±, so A± itself is not a velocity. Notice in contrast that the issue of 
precisely what Q is does not arise in the TSA because it is merely an auxiliary velocity that 
appears in the last step of the working. 

One argument for the 3-space 1-form field ansatz is simplicity: consideration of a 3- 
geometry and a single 3-d 1-form leads to Maxwell's equations. However, I argue that in 
the lapse uneliminated form, provided that one is willing to accept the additional kine- 
matics, I can extend these d.o.f's to include a dynamical A±. The TSA is about not 
accepting kinematics other than BM, but the spacetime hypersurface framework enables 
one to explore what happens when tilt and derivative-coupling kinematics are 'switched 
on'. Working within the SSF, if A± is allowed to be dynamical, there is derivative coupling, 
and consistency would require the presence of 2 further bunches of terms, with coefficients 
proportional to Ci + C2 and to C3. The first bunch consists of the following sorts 

of terms: D^A'^A^df^hab , A' (^D''A^_ - ^pKb , h'^^A'^d^AaSf^hc , 

A^A'^h'^'^d^habSf^Kd , A^A^h'''h'"'6^habS^h,d . (685) 
The second bunch consists of the following sorts of terms: 

h''' (a^D^A, + A^^) 5gKb , -h''''A^6,A^6ghab , A^A^h^^'h'HMgKa ■ (686) 



The naive blockwise Riemannian structure of the configuration space of GR and 
nonderivative-coupled bosonic fields (|619() can get badly broken by derivative coupling (c.f 
Kuchaf IV. 5). Either of the above bunches by itself exhibits all the unpleasant configuration 
space features mentioned in VI. 1.4: the first two terms of (|(i85j) are linear and hence the 
geometry is not Riemannian, the third is a metric-matter cross-term, and the last two terms 
breach the DeWitt structure; likewise the first term of (|(i8(ij) is linear, the second is a cross- 
term and the third is a breach of the DeWitt structure. If the DeWitt structure is breached 
in nature, then the study of pure canonical gravity and of the isolated configuration space 
of pure gravity are undermined. Whereas there is no evidence for this occurrence, I have 
argued in 1.2.3.4 that some forms of derivative coupling are only manifest in experimentally- 
unexplored high-curvature regimes. 

In the SSF, if A± were dynamical, then it would not be a Lagrange multiplier, and 
so it would not have a corresponding multiplier equation with which the tilt could be 
'accidentally' removed, in which case there would not exist a corresponding BSW form 
containing A±. This argument however is not watertight, because it does not prevent some 
other BSW form from existing since variables other than A± could be used in attempts to 
write down actions that obey the 3-space principles. As an example of such an attempt, I 
could use the a-dependent change of variables to Aq to put Proca theory into BSW form. 
In this case I believed when I wrote ^ that the attempt fails as far as the TSA is concerned, 
because Aq features as a non-BM velocity in contradiction with the adopted implementation 
of principle general Rl.^'^ This shows however that criteria for whether a matter theory 
can be coupled to GR in the TSA are unfortunately rather dependent on the formalism 
used for the matter field. The TSA would then amount to attaching particular significance 
to formalisms meeting its description. This is similar in spirit to how those formalisms 
which close precisely as the Dirac Algebra are favoured in the hypersurface framework and 
the HKT and Teitelboim j351j articles. In both cases one is required to find at least one 
compatible formalism for all the known fundamental matter fields. 

3 Connection between space and spacetime viewpoints 

^°But also see VII. 1 for an amendment, and for further counterexamples of uniqueness once one allows 
form 2) on p 146. 



Presupposing 3-space (taken to mean 3-geometry, not the 3-metric), the universal kinematics 
is Diff-BM. The spHt (IH^ IMH]) of aW° or perhaps more simply the equations EHOl 
16811 1682]) (and their analogues for higher-rank tensors (see e.g Sec 9 of j231j ). sum up the 
position of Diff-BM within the SSF. In this latter case, there are three sorts of universal 
kinematics for tensor fields: BM, tilt and derivative coupling. All three of these are required 
in general in order to guarantee consistency (if the spacetime theory works) and Kuchaf's 
papers are a recipe for the computation of all the terms required for this consistency. Thus 
in GR where it is available, the SSF is powerful and advantageous as a means of writing 
down consistent matter theories. If conformal gravity is regarded as a competing theory to 
GR, it makes sense therefore to question what the 4- geometry of conformal gravity is, and 
whether its use could lead to a more illuminating understanding of matter coupling than 
offered by the TSA. I am thus free to ask how special GR is in admitting a constructive 
kinematic scheme for coupled consistent tensorial matter theories. 

As BFO formulate it, the TSA denies the primary existence of the lapse. But I have 
demonstrated that whether or not the lapse is eliminated does not affect the mathematics, 
so I prefer to think of the TSA as denying 'lapse kinematics'. BFO's use of BSW forms 
does lead to a more restrictive scheme than the split spacetime framework, but I have 
demonstrated in VI. 2 that this restriction can be understood in terms of when the split 
spacetime framework has no tilt. Furthermore, I have unearthed the tacit TSA gravity- 
matter simplicity and have rephrased this and the generalized BSW form of RI[R2] 
as nonderivative coupling and the no tilt condition respectively within the SSF. 

Working in the SSF with lapse-uneliminated actions with only shift (Diff-BM) kinemat- 
ics has the additional advantage that I can immediately turn on and hence investigate the 
mathematical and physical implications of the tilt and derivative-coupling kinematics. Nev- 
ertheless, it is striking that BM kinematics sufRce to describe all of the known fundamental 
hosonic fields coupled to GR. The absence of other kinematics includes the absence of the 
derivative-coupled theories whose presence in nature would undermine the study of pure 
canonical gravity of DeWitt and others. I see my above work as support for this study. 
The less structure is assumed in theoretical physics, the more room is left for predictability. 
Could it really be that nature has less kinematics than the GR SSF might have us believe? 

Following from the debate of BM versus bare 'discover and encode' methods in II. 2. 2 and 
III. 1.2, I have by now demonstrated the gauge-theoretic gauge constraints to be integrabil- 
ities of Ti.. This permits the following alternative to starting with the BM implementation 
of general Rl: 

Integrability [general Rl] : start with a 3-d action with bare velocities. W can be deduced 
immediately from the action, and demanding 7^ ~ leads to a number of other constraints. 
These are all then to be encoded by use of auxiliary variables. 

This has the immediate advantage of treating Diff-BM on the same footing as the encoding 
of Gauss constraints (G-BM). In VII. 1 I also consider achieving the same footing but rather 
by moving in the other direction. 

So far, at least the bosonic sector of nature appears to be much simpler than the GR 
SSF might suggest, and the GR branch of the TSA may be formulated in two hitherto 
equivalent ways (making use of either the BM or the Integrability implementation of 
general Rl). I next consider both of these for spin-^ fermions. 

4 Spin- 1 fermions and the TSA 

Whereas it is true that the spinorial laws of physics may be rewritten in terms of tensors 
j292j . the resulting equations are complicated and it is not clear if and how they may 
be obtained from action principles. Thus one is almost certainly compelled to investigate 
coupled spinorial and gravitational fields by attaching local flat frames to our manifolds. 
There are two features I require for the study of the spin-^ laws of nature coupled to 



gravity. First, I want the study to be clear in terms of shift and lapse kinematics, given my 
success in this chapter with this approach. However, one should expect the spinors to have 
further sorts of kinematics not present for tensor fields. Second, I want to explicitly build 
S0(3, 1) (spacetime) spinors out of S0(3) (spatial) ones.^^ 

I begin here by looking at the first feature in the following 4-component spinor formalism. 
Introduce Dirac's suited triads, denoted by E^, which obey Eq^ = 0, Eqq = —a, ^^^CB — 
r/^g and E^^E^ = gsc- The spacetime spinorial covariant derivative is 

V^^V = - l^RsAl^l^i^ , (687) 



where n^gp = {Vb^rJ^Pp 

(688) 

is the spacetime spin connection The spatial spinorial covariant derivative is 



(689) 
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'P 



where ojfsp = (-DfeEfa)Ef e! 

(690) 

is the spatial spin connection. 

Next split the spacetime spin connection, following Henneaux's treatment jl79j which 
has the virtue of keeping track of the geometrical significance of each of the pieces. I supply 
each piece with contracting gamma matrices as suits its later application. As 

^(PO)iJ = > (691) 

there are 4 components in its decomposition, ujpqf may be used as it is, to form the 3-d 
spinorial covariant derivative. By (|690|) . suited tetrad properties, the Dirac algebra 
and Kab = Kba, Ka± = (see 1.2.1.3), 

7^"75^0g-f = -K . (692) 



By (pini) . (EH) in (EOI) and use of suited tetrad properties, '-^poo 



By (jnini), w^jo = 9oE[^iE^g]-a(i:/3Ega)E"f . 

Then, using (IM . (l69T^ and (l69n|l . 7^V^^ = 7°V|?/; + 7Vf^ 
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(693) 



(694) 



7"V|V + y^fV' + i,ri'l'^^-,mi<l^ ■ (695) 



^^This is standard use of representation theory, based on the accidental Lie algebra relation 
SO(4)5z S0(3) S0(3), which depends on space being 3-d. This relation is a common source of tricks in 
particle physics and quantum gravity. By SO(3, 1) and SO(3) spinors, I strictly mean spinors corresponding 
to their universal covering groups, SL(2, C) and SU(2) respectively. I am not yet concerned in this thesis 
with the differences betwen S0(4) and SO(3, 1) from a quantization perspective, which render Euclidean 
quantum programs easier in some respects |354II29^?] . 



Then the first term may be replaced by oVqiP = ip—£^p'ip—djiip+-dfa'j^'y^ 

(696) 

by sphtting (|694j) into two pieces; the first of these is directly geometrically meaningful, 
whereas the second is geometrically meaningful when combined with 

dRip = -7^[f^s]iYY4' (triad rotation correction) , (697) 



and 



(Lie derivative) 



(698) 

Thus the tensorial Lie derivative -C/jV' = (^^di^p is but a piece of the spinorial Lie derivative 
(|698|) |149l I22U] . The second term in ()695(1 is already in clear-cut spatial form, while the 
last term is just by (IH^ . Thus 



a^S^Dfij + + «,z7%V - (V' - £f3i^ - dRt/j) 



(699) 



Next, although derivative coupling (second term) and tilt (third term) appear to be 
present in H699|) . Geheniau and Henneaux jl49j observed that these simply cancel out in 
the Dirac field contribution to the Lagrangian density. 
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-Vham^Tpip (700) 

[c.f the second form in H99|l ]. 

Whilst Nelson and Teitelboim j278j do not regard such a choice of absence of derivative- 
coupling as a deep simplification (they adhere to the HKT school of thought and the simpli- 
fication is not in line with the hypersurface deformation algebra), the Geheniau-Henneaux 
formulation is clearly encouraging for the TSA. For, using BSW procedure on the 3-1-1 
split Einstein-Dirac Lagrangian, one immediately obtains a generalized Rl-castable-type 
action: 



'ed 



dA / d^xVh 



(701) 



Thus one knows that 



A + aR + ?/;t707' rev - 7V(^fV'"^)'0 - m^7p7pJj^{S,^hij) 



(702) 



[a particular case of the form H622|) ]. will work as a spatial ontology starting-point for 
Einstein-Dirac theory. Note that in addition to the more complicated form of the Diff-BM 
correction to the Dirac velocities, there is also a triad rotation correction (|697|) . Thus either 
of the following arguments should be adopted. The BM [general Rl] implementation 
should be generalized to accommodate this additional, natural geometric correction: given 
two S0(3) spinor bundles Si, S2, the (full spinorial) drag shufflings of S2 (keeping Si fixed) 
are accompanied by the rotation shufflings of the triads glued to it. The triad rotation 
correction then leads to a further 'locally Lorentz' constraint ^T^p jlllj . Alternatively, it 
can be checked that this constraint will be picked up by the Integrability [general Rl] 
implementation, since it is an integrability condition. This check may be implied to be 
verified by Nelson and Teitelboim's work [278j : both Tii and jT^p are indeed integrability 
conditions for TL. For in terms of Dirac brackets { , }*, starting from TL, {Ti^'H}* gives 
Tii and then we can form {TL^'Hi}* which gives J^j^ (and H) so one has recovered all the 
constraints as integrability conditions for 7i. Also, one does not recover 7i if one starts with 
Tii or Jxbi s° some sense H. is privileged. However, this does highlight my other caveat 
for the integrability idea: one might choose to represent the constraint algebra differently 
by mixing up the usual generators. For example, a linearly-related set of constraints is 
considered in |278j , for which the integrability of any of the constraints forces the presence 
of all the others. My defense against this is to invoke again that we only require one 
formulation of the TSA to work, so we would begin with the quadratic constraint 7i nicely 
isolated. 

Also note I succeed in including the 1-form-fermion interaction terms of the Einstein- 
Standard Model theory: 

g-^TMi>l~^^jA\^ (703) 

where A takes the values U(l), SU(2) and SU(3). The decomposition of these into spatial 
quantities is trivial. No additional complications arise from the inclusion of such terms, 
since 1) they contain no velocities so the definitions of the momenta are unaffected (this 
includes there being no scope for derivative coupling) 2) they are part of gauge-invariant 
combinations, unlike the Proca term which breaks gauge invariance and significantly alters 
the Maxwell canonical theory. In particular, the new terms clearly contribute linearly in 
A_\_ to the Lagrangian potential, so by the argument at the end of VI.2.2, an 'accident' 
occurs ensuring that tilt kinematics is not necessary. Also, clearly the use of the form 
H70U|) is compatible with the inclusion of the interactions (|7U3() since, acting on ■0 the gauge 
correction is the opposite sign. So our proposed formulation's combined Standard Model 
matter Lagrangian is^'^ 



-SM= 2 



1 - A 



(^^^(V^ - g^rf E^A^)V - (V^ + gA^M^A)^^^ - m^^i^ + lyu • (704) 



which can be adjoined to L3_|_i(gr) , and successfully subjected to the BSW procedure, thus 
announcing that an enlarged version of ()702() will serve as a spatial ontology starting-point. 

There is also no trouble with the incorporation of the Yukawa interaction term <;'^l>^l; 
which could be required for some fermions to pick up mass from a Higgs scalar. Thus 
the Lagrangian for all the known fundamental matter fields can be built by assuming BM 
kinematics and that the DeWitt structure is at the outset respected. The thin sandwich 
conjecture can be posed for all these fields coupled to GR. The classical physics of all these 
fields is timeless in Barbour's sense. 

It remains to cast the action (|7()2|) in entirely spatial terms, as it still has remnants 
of spacetime in its appearance: it is in terms of 4-component spinors and Dirac matrices. 
However, recall that the Dirac matrices are built out of the Pauli matrices associated with 
SO (3), and choosing to work in the chiral representation H95() . the 4-component spinors 

^^Here Lym is given by the m — 0, SU(3) x SU(2) x U(l) version of l|656|l and one would need to sum 
the square bracket over all the known fundamental fermionic species. 



may be treated as D and L S0(3) 2-component spinors H94|) . Thus a natural formulation 
of Einstein-Dirac theory in terms of 3-d objects exists. To accommodate neutrino (Weyl) 
fields, one would consider a single ipj^ S0(3) spinor, i.e set its pair Vd and its mass to 
zero before the variation is carried out. Whilst we are free to accommodate all the known 
fundamental fermionic fields in the TSA, one cannot predict the number of Dirac and Weyl 
fields present in nature nor their masses nor the nongravitational forces felt by each field. 

Suggested Further Work 

Here are some suggestions for further work given the above progress. Consider lapse elim- 
inated or lapse-eliminable actions including and T^ built from spatial first principles 
using S0(3) spinors. Obtain 7i and treat its propagation exhaustively to obtain constraint 
algebras. Does the universal null cone result still hold ? Is Einstein-Dirac theory in any 
sense picked out? How does the thin sandwich conjecture for Einstein-Dirac theory be- 
have? On coupling a 1-form field, do these results hold for Einstein-Maxwell-Dirac theory? 
On coupling K 1-form fields, do they hold for Einstein- Yang~Mills~Dirac theories such as 
the Einstein-Standard Model? There is also the issue of whether conformal gravity can 
accommodate spin-^ fermions. 

It is worth considering whether any of my ideas for generalizing the TSA extend to 
canonical supergravity |1041 114H I295| llOfll llflU I2n6j . or to the minimal supersymmetric 
standard model This could be seen as a robustness test for our ideas and possibly lead to a 
new formulation of supergravity, or as a precaution of looking if our way of seeing the world 
is compatible with physics that could conceivably soon become observationally established 
(see 1.3.3). The supergravity constraint algebra is not known well enough^^ to comment 
whether the new supersymmetric constraint arises as an integrability condition for Ti.. 
Note however that Teitelboim was able to treat 5^ as arising from the square root of Ti. j35Uj : 
however this means that the bracket of and its conjugate gives so it is questionable 
whether the supergravity 7i retains all of the primary importance of the GR 7i. 

Finally, given the competition from |38|l6ll9] and this thesis it would be interesting to see 
whether the HKT or Kouletsis formulations can be made to accommodate spin-^ fermions, 
and also to refine Teitelboim's GR-matter postulates to the level of these formulations' pure 
GR postulates. 

VII More on TSA matter schemes 

In this chapter, I use the most general spacetime (rather than TSA) single 1-form ansatz. 
I then use the arguments of VI. 2 to pass to TSA formulations where possible. I carefully 
show how the TSA does permit a formulation of massive 1-form fields. This completes the 
dispelling of the possibility that the TSA has something to say about the origin of mass. 
This and further more complicated examples show that electromagnetism is not particularly 
being picked out by the TSA's relational principles, thus also dispelling any "hints at partial 
unification" . I also use the single 1-form example to smooth out the treatment of lapse- 
eliminability in VI. I finally consider in the light of my examples what the TSA has to 
say about the Principle of Equivalence (POE). I make much use below of the following 
mathematics. 

More on multipliers and cyclic coordinates 

I am dealing with fields, so spatial derivatives may enter the multiplier and cyclic equations 
1)421 I43|) rendering them p.d.e's. Sometimes this can be avoided through integration by 
parts. 

^^The most recent confirmation that we are aware of tliat this work remains incomplete is on p. 96 of 



In some cases, elimination from a multiplier equation is not possible. This occurs for L 
linear in the multiplier (so the multiplier equation does not contain the multiplier itself), for 
L homogeneous in the multiplier (so the multiplier equation contains the multiplier as a pure 
factor), or if the multiplier equation, be it p.d.e or algebraic, is insoluble or not explicitly 
soluble for the multiplier. The insoluble case would mean the system is inconsistent. It may 
also not be possible to eliminate using the cyclic equation 1)431) . This is more delicate since 
it is accompanied by passage to the Routhian. It is not possible to do this if L is linear in 
the cyclic velocity so that (|43p is independent of it, or insoluble or not explicitly soluble. 
Finally, note that some of the above features depend on the choice of variables. 

1 Update of TSA with a single 1-form 

1.1 A means of including Proca theory 

Note that whether a theory can be cast into TSA form can only be treated formalism by 
formalism. I explain here how to obtain a formalism in which Proca theory is allowed. The 
tricks used do not suffice to put all other theories I considered into TSA form. Thus the 
TSA retains some selectivity, although I demonstrate here via including Proca and various 
other theories that the TSA is less selective than previously assumed. I tie this selectivity 
to the FOE in VII.2. 

Of what use is the inclusion of Proca theory? It is then easy to see how to include massive 
Yang-Mills theory as a phenomenological theory of what weak bosons look like today, and 
Proca theory itself appears phenomenologically e.g in superconductivity. Whereas these 
applications are quite peripheral, it is nevertheless reassuring that one need not abandon 
the TSA to do phenomenology. 

Here is another way of looking at electromagnetism. The 'accident' method of VI. 1.2 
'lets go' of the constraint; fortunately it is 'caught again because it arises as an integrability, 
but one would not generally expect this to be the case. One could rather avoid the tilt by 
redeEning variables according to A± — > Aq = —aA±. Then one never 'lets go' of the 



Do Proca theory just like the above. Whereas this was previously objected to on p 158, 
notice that my observation on p 115-6 also holds {Aq can be taken to be a BM velocity 
if one so wishes as this only changes the equations weakly), which removes the objection. 
Thus the following is a TSA formulation of Einstein-Proca theory 



Also note that the Proca constraint is second-class. It then makes no sense by definition 
to work immediately with constraint propagation. Rather, the way to proceed is to obtain 
the Proca constraint, use it on the other constraints to eliminate Aq and then one has 
constraints which close. 

Why did we not hit on Proca theory before? Before we were doing 'discover and encode' 
as regards non-gravitational auxiliaries. This does not work here, ostensibly because the 
Proca constraint is second-class, while 'discover and encode' is a procedure whereby gauge 
theory (associated with first class constraints ^182]) is emergent. I require rather the more 
broad-minded approach in which all the auxiliaries are treated on the same footing by 
being present in the action from the outset. This works here by starting with actions for 
3-geometries together with one scalar and one 1-form matter fields. To have Proca theory, 
that scalar then turns out to be the above auxiliary. 

Similarly, gauge theory breaking down near singularities (IV. 1.4.3) is actually a permis- 
sible choice rather than an inescapable necessity. For, if one chooses to adopt the action 
resulting from the 'discover and encode' TSA formulation, then gauge theory breaks down, 
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while if one chooses to start from the outset with scalars alongside gauge variables then it 
does not break down. 



1.2 Further nonuniqueness examples 

First, choose to assume that there is no fundamental underlying theory so that local flat 
spacetime naive renormalizability makes sense. This puts a stringent bound on how many 
terms can be in the 1-form ansatz. 

The ansatz is = C'^^^'^V bAaV dAc + C^^^^V bAaAdAc + mA^ + qA^ 

(706) 

(the other combinations are total derivatives or zero by symmetry-antisymmetry). Note 
that I already split the first term in VI. 2. 4. The second term has in effect only one piece 
bA'^g'^^V aAb (including use of integration by parts). The last two terms are trivial to 
decompose. 

Using (|(i79H(i82'|) . the second term's tilt contribution vanishes by parts. Suppose the 
first term is present but not the second. Then the action is TSA-castable if Ci = —C2 and 
C3 = either by the A± multiplier equation 'accident' or by the Aq. If the second term is 
also present the first of these options is blocked because A± is then not a multiplier, but the 
Aq option survives thanks to additional integration by parts. The third and fourth terms 
are compatible with both options. 

Thus I have found two classes of single 1-form theories which I can cast into TSA form: 

if = aV[A^B]V[^^^] + mA^ + qA"^ , (707) 

lf = lf + bA^VAA"^ . (708) 

The first is Proca theory ii q = 0. If g 7^ 0, its Aq formulation has a Lagrangian density 
of type 

^l^ = Vha^A + ^ + ^^ , (709) 
so the a-multiplier equation is Aa'^ — Ba^ — 3C = 0, so by the quadratic formula. 
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(710) 



Compared to = a (A + ■^) which gives L = 2\/AB, the above is far more complicated [an 
example of my case 2) in VI. 1.4] but nevertheless a valid TSA presentation. Thus A^-theory 
was excluded by BFO on simplicity grounds rather than for fundamental reasons. 

For a = 0, the second class coupled to GR is linear as regards a (because it is in terms 

of A±) and thus gives a TSA theory of form L = 2V AB+D 

(711) 

where D is the linear kinetic term. This is similar in layout to the TSA formulation with 
spin-^ fermions, except that here tilt elimination does not happen to also eliminate the 
derivative coupling terms. 

Next, choose to assume there is a fundamental underlying theory rendering naive renor- 
malizability irrelevant. Are there then any other sorts of TSA theories with a single 1-form? 

Clearly adjoining any polynomial in to the spacetime Maxwell Lagrangian will do, 
and the TSA forms just get nastier. A messy Cardano formula is required if some amount of 
A^ term is furthermore present. If terms as high as A^^ are present, despite being algebraic 
the a-multiplier equation is not generally explicitly exactly soluble for a by Galois' well- 
known result. 



I also considered Born-Infeld theory. The easiest non-Maxwell Born-Infeld theory 
[L = (F o F)'^] also leads to the TSA form (jTlUj) . while the Born-Infeld theory particular to 
string theory also gives a a-multiplier equation not generally explicitly exactly soluble for 
a. 

Thus the TSA in fact admits a broad range of single 1-form theories. 

2 TSA and the principle of equivalence 

Curved spacetime matter field equations are locally Lorentzian if they contain no worse than 
Christoffel symbols [by applying ©]. The gravitational field equations are given a special 
separate status in the POE ('all the laws of physics bar gravity'). However, derivatives 
of Christoffel symbols, be they from double derivatives or straightforward curvature terms 
multiplied by matter factors, cannot be eliminated likewise and are thus POE-violating 
terms. 

Let me translate this to the level of the Lagrangians I am working with. If the Lagrangian 
may be cast as functionally-independent of Christoffel symbols, its field equations clearly will 
not inherit any, so the POE is satisfied. If the Lagrangian is a function of the Christoffel 
symbols, then by the use of integration by parts in each Christoffel symbol's variation, 
generally derivatives will appear acting on the cofactor Christoffel symbols, leading to POE- 
violating field equations. A clear exception is when the Lagrangian is a linear function of the 
Christoffel symbols. Lagrangians unavoidably already containing matter-coupled Christoffel 
symbol derivatives lead to POE-violating field equations. 

The first 1-form Lagrangian above contains no Christoffel symbols by antisymmetry (as 
do Yang-Mills theory and the various bosonic gauge theories). Linearity gives a guarantee 
of protection to Dirac theory, in a different way from the fortunate rearrangement in VI: 
despite being derivative-coupled this behaves according to the POE by this linearity. This 
means also holds for my second class of 1-form theories above. On the other hand, the 
excluded 1-form theories have Lagrangians nonlinear in the Christoffel symbols. Thus the 
TSA and the POE are acting in a similar way as regards the selection of admissible theories. 
I conjecture that (possibly subjected to some restrictions) the TSA leads to the POE. 

Crudely, tilt and derivative coupling come together in the spacetime split, and tilt tends 
to prevent TSA formulability. Crudely, tilt and derivative coupling originate in spacetime 
Christoffel symbol terms, which are POE-violating. At a finer level, 1) I know tilt and 
derivative coupling need not always arise together by judicious construction otherwise (my 
second class of 1-form theory example). This could potentially cause discrepancies be- 
tween TSA formulability and obedience of the POE from derivative-coupled but untilted 
examples. 2) Christoffel-linear actions are not POE-violating; moreover in the examples 
considered [Dirac theory, the standard interacting theories related to Dirac theory, and the- 
ory (|7U8|) ] this coincides with unexpected TSA formulability. 2) also overrules the given 
example of 1) from becoming a counterexample to the conjecture. 

There is one limitation I am aware of within the examples I considered. I showed 
that Brans-Dicke (BD) theory, whose Lagrangian contains Christoffel symbol derivatives 
multiplied by matter terms (the e~^R term), happens to have derivative coupling but no 
tilt. Hence this is another example of 1), but now BD theory is TSA formulable but 
POE-violating. Thus one should first classify POE-violating Lagrangians into e.g ones 
which merely containing covariant derivatives and ones which contain matter coupled to 
the curvature scalar, and then have a conjecture only about the former. BD theory was 
in fact the first theory considered in the TSA in connection with the POE (see j38j and 
now my conformally-untransformed working in VI. 1.4). But then I found all those other 
theories in VI and above have TSA fomulability and the POE occurring together. I suggest 
a further systematic search for (counter) examples should be carried out: including many 
1-forms, higher derivative gravity terms, torsion. 



2.1 On the origin of gauge theory 



Traditionally, gauge theory arises in flat spacetime by gauging a la Weyl (1.1.7). One 
postulates Lorentz-, gauge- and parity-invariance in finding the dynamics of the gauge 
fields themselves. Observation then requires some of the gauge theories to be broken a 
la Higgs. However I emphasize instead that gravitation may be doing some of this for us. 
Non-Maxwellian curl leading derivative terms in flat spacetime are just as Lorentz-invariant 
as Maxwellian curl ones, but accepting that we live in a curved GR spacetime, the former 
could not locally arise in the first place since they are POE violators. Thus both gauge 
theory and its broken form may be seen as arising from GR rather than separate gauge and 
gauge-breaking postulates. 

This viewpoint does not just relate to how TSA geometry-matter results come about. 
I find Teitelboim also came across it: "our efforts for preserving path independence have 
led us to gauge invariance" |,S48j . But, as the previous paragraph indicates, I also see it 
plainly in the unsplit spacetime formulation of GR, which makes me suspect that this point 
is older and has been forgotten. 

One issue is whether anything new could be inspired by this point of view. For example, 
might theories along the lines of my second class of 1-form theory, which obeys the POE via 
Christoffel-linearity rather than absence of Christoffel symbols through the Maxwellian curl 
leading derivatives, also be present in nature? This might underly cosmological mysteries 
or future particle physics experiments. N.B whether this is overruled on other grounds does 
not affect the viability of the above viewpoint on the origin of gauge theory. 



VIII Toward a quantum TSA? 

This chapter describes the present state of work toward quantization, which underlies a 
fair amount of preceding material. Recollect that spacetime and space are suggestive of 
different quantization approaches. 

1 Barbour's suggestion for quantization 

Barbour favours the timeless naive Schrodinger Interpretation (NSI) of quantum gravity.^^ 

^'*This includes the POE holding perfectly; the argument is unafltected by replacing this part of GR by 
the POE holding up to somewhat past our current stringent observational limit on POE violation. 

^^This idea was originally Hawking's |176| , and also used by him and Page |178| . and by Unruh and Wald 
|359| who coined its name. 



This is about the probabihty of this or that universe configuration occurring, based on the 



Whereas Barbour had hopes of rendering the GR configuration space tractable in a geodesic 
principle formulation, I have shown in VI. 1.3 that the GR configuration space curve study 
is a lot nastier than determining Riemannian geodesies (and conformal gravity retains at 
least some aspects of this nastiness). 

Let me illustrate with simple toys that the geodesic principle was a good conceptual idea. 
For a 3-body system, the configuration space is the Triangle Land, the space of triangles 
Tj. Here typical NSI-tractable issues are asking what P(T equilateral) or P(T isosceles) are 
[compare Hawking and Page asking what P (large fiat universe) and P (inflation) are |178j ]. I 
expect the configuration space metric would often play an important role in answering these. 
Gergely has studied the relevant geometry |151j . Detailed comparison with Superspace has 
not been done (and as Triangle Land has Superspace-like features such as stratification and 
singular barriers, this may be an interesting project). For Minisuperspace, Misner wrote 
j272j that he eventually came to understand that the considering the configuration space 
metric (the minisupermetric) was often crucial at both a classical and a quantum level (but 
beware the zeros!). 

My above criticism should be taken as a lack of extendibility of insight depriving one 
of tools, but not a fault. But the NSI should be surveyed for faults. Barbour and I think 
that its 'non-normalizability leading to relative probabilities only' feature is not a fault but 
a limitation (be it technical or actually physically-realized), so I do not worry about this 
here. But I do see the following as faults 236, 201 . 

Fault 1 No questions of becoming, such as what is P (isosceles evolves toward equilateral) 
or P(Bianchi cosmologies isotropize toward FLRW cosmologies), are answerable. 
Fault 2 In the NSI, the WDE equation is unaccounted for; if accounted for, there is 
incompatibility: the NSI i.p does not respect the WDE solution space. 

Barbour then proposes the 
Time Capsule Conjecture for TISE-type equations, 

1) there exist configurations with 'memories' or 'diaries' or 'fossils' or 'records' from which 
information about history might be inferrable in an actually timeless world. 

2) Furthermore, TISE-type equations might favour such configurations. 

3) This might be driven by the asymmetric shape of the configuration space. 

This conjecture is an extrapolation of the behaviour of a toy TISE, the one which 
Mott j27fij used to explain the occurrence of alpha particle tracks in bubble chambers. 
These track configurations are time capsules, and prima facie very unlikely configurations 
but nevertheless often visible in the laboratory. The target of the conjecture is the WDE 
viewed as a TISE. 

Note that 1) is compatible with the NSI, and is a novel means of cutting down fault 1, but 
2) clearly takes one outside the NSI because the WDE is invoked. Thus the NSI i.p needs to 
be replaced, and my main concern is whether this replacement by a WDE-compatible i.p is 
generally possible. I explain below that some of the theories in this thesis (viewed as toys), 
are rather nicer than GR as regards the i.p Problem and might just be related to a method 
of use in full quantum gravity. Also note that whereas elements of the above conjecture 
have begun to be substantiated for simple semiclassical and Minisuperspace WDE examples 
[Hl \ I17()j . these works use consistent histories rather than the NSI. 



l.p 
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2 Wheeler— DeWitt approach 



2.1 Strong gravities 



I must first fix the manifold structure of space. I promote hat, p to quantum operators 



and adopt tlie configuration representation hab{x) = hab{x) , ff^ = —i— — 

ohab{x) 



(713) 



I tlien choose the constraints to read — ;r^|^' >= 

dhab{x) 

(714) 

(ordered with hij to the left) which signify that |^ >= |^'(3-geometry) > alone, and the 
arbitrary-P^/^ strong gravity WDE 



|^'>=0. (715) 



Next one requires an i.p. As in the argument for GR (1.3.3.3), consider first a Schrodinger 
i.p in attempting to resolve the i.p Problem. Now, unlike GR, in the W < ^ cases this pro- 
vides an immediate resolution of the i.p Problem, since now a probabilistic interpretation 
is admitted, straight from the supermetric's positive-definiteness. Thus I have direct ana- 
logues of (E = 0) TISE's with constant potential. This i.p should be normalizable, and as 
an analogue of 'first quantization', it does not require a 'H — ' split of states. 

For > i the Schrodinger i.p does not admit a probabilistic interpretation, straight 
off from the supermetric's indefiniteness. So contemplate a Klein-Gordon i.p instead. Now 
unlike GR the potential is of a fixed sign, and thus directly analogous with the standard 
Klein-Gordon equation (or the tachyonic one, depending on the sign of the A 'mass'). Thus 
the Klein-Gordon type i.p ought to be normalizable. There is then the issue of the 'H — ' 
split of states. This follows in pure strong gravity theory because there is a Superspace 
Killing vector for all W , which is furthermore timelike for each W > ^. The use of this 
Killing vector, unlike in GR, is now acceptable since it adequately respects the W > ^ 
strong gravities' much simpler potential term. 

NB if i?-perturbations are to be much smaller than A, they cannot alter the sign of the 
potential. Thus there is hope for the above procedure to be applicable to GR and a range of 
scalar-tensor theories somewhat away from the extreme regime. This would closely parallel 
Isham and Pilati's perturbative quantization idea of expanding about strong gravity, in 
which a GR regime is recovered away from the singularity |2nn| I297| I298| I142j . At the 
very least, I have opened up the W ^ 1 extension of this study, of relevance to quantum 
scalar-tensor theories. But note that one may have to confront Pilati's full GR quantization 
stumbling block: obtaining a quantum expression for the i?-perturbation. 

2.2 Conformal gravity 

Again I fix the manifold structure of space. I promote hab, p""^ to quantum operators 
and adopt the position representation hab{x) = hab{x) ^ (j) = cj) , p"'''{x) = —i 



6hah{x) 

(716) 

S S 
I then choose the constraints to read Df,— — >= , habTi — r^l^ >= 

dhab{x) ohab[x) 

(717) 

(ordered with hij to the left), which signify that |^ >= |^ (conformal 3-geometry) > alone. 



and the conformal gravity WDE 



R(hij(x))-8- 



\qi >= 

Vh{x)(l)'^{x) 5hij{x)5hki{x) yl \ " ^ 4>{x ' 

(718) 

Now the choice of some W < ^ (e.g the W = above) permits the use of Schrodinger 
i.p, and makes quantum conformal gravity directly analogous to a E =0 TISE. 

Quantum conformal gravity has the additional issue of whether the LFE should be 
imposed classically or quantum-mechanically. I would opt to treat it classically, akin to a 
gauge-fixing condition. Tentatively, one has a timeless universe described by a TISE-like 
equation, nevertheless a preferred N emerges as the solution of the LFE, which (being 
monotonic) may be used to order configurations along curves in configuration space. The 
privileged foliation ties the further multiple choice aspect of the POT^^ and the sheaf 
aspect of the configuration space degeneracy to the existence and uniqueness properties of 
the elliptic LFE, for which many tools are likely to be available. The global POT aspect^^ 
also becomes tied to the properties of the LFE. These can be contemplated at a far more 
trivial level for strong conformal gravity. The foliation-dependence of quantization POT 
aspect is rendered irrelevant by the theory admitting a preferred foliation; this Problem 
rather becomes whether preferred slicing theories are viable descriptions of any part of our 
universe. The 'Hlip >= 7^ |[7i,Cy]||^ >= POT aspect remains unchallenged. 

Note that the configuration space CS is better behaved than Superspace in being 
geodesically-complete (c.f L2.8.1, L2.9.4.2). Also note that it remains unexplored whether 
the linear constraints can be dealt with classically in strong and conformal gravities; in 
the latter case this would presumably entail 'Conformal Superspace quantization' rather 
than 'Superspace quantization'. One interesting point is whether arbitrary-VK theories are 
perhaps more than toys, since varying-VF universes could be viable. This is inspired by 
scalar-tensor theory's tendency to GR-like late-universe behaviour. I speculate that some- 
thing similar might happen in some sort of CS+V theories, permitting the emergence of 
GR-like spacetime from a privileged-sliced early universe regime. It remains to be seen if 
such a regime could take W values with canonical theory markedly distinct from GR in 
quantum-cosmologically relevant regimes. Note that it follows from j216j that full scalar- 
tensor theory itself can posses unpleasant ultrahyperbolic supermetric regions in addition 
to hyperbolic supermetric regions, but not elliptic supermetric regions. 

I finally note that the above TISE's are unlike the many wave-equation-like toys habit- 
ually used in the quantum gravity literature. The corresponding toys would now rather be 
Helmholtz-type linear elliptic equations (see C.l). The potential sign then becomes impor- 
tant for good behaviour; the correct sign is clearly enforced in the gravitational theories due 
to Tl: the kinetic term's positivity determines the potential term's sign. Helmholtz-type 
equations on manifolds are known to have intricately-patterned shape-dependent solutions 
(see |252j for beautiful examples of patterns in compact universe models) . The superspace 
analogue of this might just help with 3) of Barbour's conjecture. 

3 Other approaches 



First for the old and the new. Following from the classical work in in.1.6 and V.2.2.3, it is 
likely that Wheeler's quantum-mechanically motivated thin sandwich scheme is realized in 
some of the alternative 3-space theories. Again a small i?-perturbation should not disrupt 
any pure strong gravity theory that works in this way. On the other hand, I have already 
argued that Ashtekar variables-type approaches do not work in IIL1.5. 



This is the possibihty that the quantum theory could depend on which choice of time function is made 
^^This is the possibihty that no time functions exist globally |236ll2Uf] . 



3.1 Internal time approach 



Second, I consider the search for an internal time, motivated by a (typically GR) spacetime 
ontology. Decompose the traditional variables into (/ij^^'* = h^shij, \^;p^^^ ,p) and perform 
the canonical transformation to {hfp^,TY',p^^-^ ,Vh)- Note that the definition of ry as the 
coordinate conjugate to ^/h is M/^-independent. Then treat 7i classically as an equation for 
the scale factor (now related to Vh). Having solved this, 7i may be replaced by the equation 

Vh{x) = Vh{x, h^f,TY;p^'^) . (719) 

Since ^/h is now a momentum, its corresponding coordinate ry (the York time) serves as a 
good internal time function, as quantizing and adopting a position representation yields a 

(720) 

Then the corresponding i.p solves the i.p Problem. Unfortunately in GR 7i becomes the 
nonlinear Lichnerowicz p.d.e, which is not exactly soluble so one does not know how to 
explicitly construct Vh{hf^^^{x),TY;p^^^ (x)) 



But for the strong gravity theories, W^ong is just an algebraic equation (|377jl . solved 
by dSZHI)- Thus, (fTTT^ is exphcitly 



^unit /jUnit „Ti j j^Tfci 



This gives the TDSE i _ I > = - / WV^it /^unit I ^' > , 

(722) 

where I have straightforwardly rescaled the internal time used. Note that the Hamiltonian 
(RHS operator) is of the form V D'^ for the functional Laplacian (containing a positive- 
definite pointwise supermetric which aids its well-definedness). 

Thus York time is a practically-realizable internal time in strong gravities. The W = 1 
case of this was alluded to by Pilati in |297j- He also observed that it is York time that 
crucially leads to the previous section's Klein-Gordon approach's Killing vector, and also 
that this conformal mathematics will break down if an /^-perturbation is included. Thus I 
cannot see the above internal time use surviving away from the singularity. 

For some conformal gravity theories, the use of York time is as for GR. It is a good 
internal time in principle but in each case one requires the exact solution of a Lichnerowicz- 
type equation for the scale factor in order to make progress. In others, like in conformal 
gravity and Kelleher's theory, York time is not a good internal time even in principle, since 
it is frozen. Also, if A*" is emergent, one has less use for another internal time candidate r. 



B Higher-dimensional spacetime? 



This largely separate Part makes a start |15| I16[ |Hj on applying 3 + 1 split ideas to the 
study of higher-d spacetimes with large extra dimensions. Recollect from 1.3.3.4 that these 
possibly play a role in string theory phenomenology. They involve most of the physics being 
confined or closely-bound to lower-d braneworlds surrounded by a higher-d bulk spacetime. 
Some models studied are directly relevant to string theory and M-theory: Hofava-Witten 
theory jl87j , heterotic string phenomenology j2()()j and the related ekpyrotic scenario j213j . 
However, many models studied [Tl 13191 ITHl 11601 13071 l332j are in fact formulated within 
the framework of (higher-d) general relativity (GR). Among these are: 

1) the second Randall-Sundrum scenario |3U7j in which the graviton is tightly-bound to the 
brane by the curvature due to the warping of the bulk metric, which is pure anti de Sitter 
(AdS). 

2) A more general scheme of Shiromizu, Maeda and Sasaki (SMS) (3321, in which the 4-d 
Einstein field equations (EFE's) are replaced by 4-d "braneworld EFE's" (BEFE's), which 
are not closed since there is a 'dark energy' Weyl tensor term, knowledge of which requires 
solving also for the bulk. The perceived interpretational difficulties due to this constitute 
the 'Weyl Problem'. The other main distinctive feature of SMS's BEFE's is the presence 
of a term quadratic in the braneworld energy-momentum, which arises from the junction 
condition used [Ml EHl 12021 ■ 

Two important questions arise in consideration of such models. 
Question 1: how should such models be built and interpreted consistently within the 
framework of the 5-d EFE's? This would require a careful underlying choice of conceptually- 
clear general framework, in the sense we discuss below. 

Question 2: what exactly is the connection between such models and the underlying 
string or M theory? More precisely, to what extent can the agreements or otherwise of 
predictions of such models with observations be taken as support for (or disagreement 
with) such theories? We concentrate on question 1 here, making a comparative study of the 
general schemes that have been employed in the literature in order to construct bulks which 
surround thin matter sheets such as branes or domain walls. Two broad schemes have been 
proposed to construct bulks: (3, 1; 1) constructions |i82i i3ST[ l362j starting from information 
on a (3, 1) spacetime hypersurface (usually taken to be the brane we hypothetically live 
on), and (4, 0; -1) constructions [3331 1277] starting by the construction of data on a (4, 0) 
spatial hypersurface. However, we first emphasize that one should grasp the fundamental 
arguments and results before specializing to the thin matter sheet models. 

We begin by arguing in B.1.1 against (n, 1; 1) constructions. In B. 1.2-3 we add to the 
arguments in 1.2.3.2 and 1.2.9.2 against the usefulness of the Campbell-Magaard embedding 
result both as a theoretical pretext and as a constructor of higher-dimensional spacetimes. 
Having argued for the superiority of York's and not Magaard's data procedure, we investi- 
gate the extent to which this is adaptable to s = 1, e = 1 (B.1.4). Other data construction 
techniques are considered in B.1.5. We further consolidate the viewpoint against 
(n, 1; 1) schemes by arguing in B.1.6 against a suggested virtue of these: their use to remove 
singularities ^302,, .155, . 

We then introduce thin matter sheets in B.2, and study the (3, 1; 1) schemes with thin 
matter sheets, recollecting the derivation of the junction conditions (B.2.1), and showing 
how the SMS formulation (B.2. 2) may be reformulated in a large number of ways using geo- 
metrical identities that interchange which terms are present in the braneworld EFE's. This 
is illustrated by the formulation in B.2. 3 which directly parallels the GR CP formulation 
and thus makes no explicit use of the Weyl term, and by formulations in B.2. 4 in which the 
quadratic term has been re-expressed entirely in terms of derivatives off the brane. These 
formulations are used to clarify a number of aspects of the 'Weyl Problem', in particular to 



argue for the study of the full brane-bulk system. Further aspects of such formulations are 
discussed in B.2.5. The implications of material in B.l in the case of thin matter sheets, 
are discussed in B.2.6-7. 

In B.3, we continue the study of the (n, 0; -1) scheme favoured by our arguments, in 
the presence of both thin and thick (i.e finitely thin) matter sheets. We begin in B.3.1 by 
providing a hierarchy of very difficult general thin matter sheet problems in which the main 
difficulties stem from low differentiability and details about the asymptotics. Within this 
class of problems we identify how the far more specific scenarios currently studied emerge 
as more tractable cases, and thus identify which as-yet unjustified assumptions such studies 
entail. The IVP step, of use in the study of how braneworld black holes extend into the 
bulk (the 'pancake' versus 'cigar' debate HU)) involves less assumptions. Thus we restrict 
attention to the (n, 0) data construction problem, which we apply to thin matter sheets 
in B.3. 2 and more straightforwardly to thick matter sheets in B.3. 3. We then conclude as 
regards question 1. 

1 Sideways problem in absence of thin matter sheets 

With the advent of schemes to construct bulks [HS OHSl EZZl EHU EZ[ EMI , the old questions 
regarding identification of sensible input and output for p.d.e systems (1.2.3.1-2, 1.2.8-10) 
become relevant in a new context. We principally use the arbitrary (r, s; e) ADM-type 
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Figure 14: When we simultaneously talk of several splits in Part B, we use the notation presented here. 
The white arrows represent the extrinsic curvatures associated with each of the embeddings. 

split H136|l . for which further specific notation is given in fig 14. Discussion of some of the 
literature |284| 1366] requires another metric split, the arbitrary (r, s; e) KK-type split 

^^^=U^V /... + 6a>VaJ =\-a^ h^' ) ■ ^^'^^ 

Note first that for KK theory itself, this split is (3, 1; 1), with hcd, o,c, $ held to be indepen- 
dent of the added dimension's coordinate z. It is this cylindricity condition that implicitly 
permits Oc to be interpreted as the classical electromagnetic potential. More recent general- 
izations of this scheme, called 'noncompact KK theory' |2841 136()] . have involved large extra 
dimensions in place of cylindricity, by permitting hcd, etc ,^ to depend on z. In some simple 
instances, Oc is held to be 0. But clearly for = and f3c = 0, the two splits H136|) and 
(|723|1 are identical. So there is already a vast literature on embeddings with a large extra 
dimension since this is what is usually treated in the GR CP. This fact is not exploited in 

^*This requires the identification a <-> More generally, a KK split is the inverse of an ADM split with 
the identifications a ^ /3i ^ — a^. 



[2841 \'6fj6[ . where rather the Campbell-Magaard result supposedly plays an important role. 
But from the far larger and more careful GR CP and IVP literature, one can easily infer 
that such a use is technically and conceptually flawed. 

Note that the above GR CP analogue suggests at a simple level that (3, 1; 1) methods 
such as that of SMS might be viewed as z-dynamics i.e "dynamics" with the coordinate 
of the new dimension, z, as IDV. Such a "dynamical" interpretation does not follow from 
KK theory, since there hcd was taken to be independent of z. Whereas a way of relating 
SMS's BEFE's and 'noncompact KK theory' has been pointed out 303 , it is surely far more 
revealing to relate the steps leading to SMS's BEFE's to the GR CP and IVP literature as 
we do in B.2. 

1.1 Bad or unexplored behaviour of sideways analogue of GR CP 

1.2.3.2 collects signature-independent results, including the Campbell-Magaard result. Al- 
though this is identified as an arbitrary-signature CP and IVP result and already heavily 
criticized in the Introduction, I now further criticize it because of recent suggestions of ap- 
plications of this result both in 'noncompact KK' theory |3121 128^1 1366j and in braneworld 
bulk construction |13[ I14[ I329j . Some of these criticisms hold more broadly against the 
other (3, 1; 1) constructions of bulks. My concern in this section is that the (r, 1; 1) coun- 
terparts of important GR CP results become bad and unexplored. Further poor behaviour 
of Magaard's sideways IVP is in B.1.2. I sum up against the Campbell result in B.1.3. 

Most of the important results of the GR CP (c.f 1.2.3.3) turn out to be entirely dependent 
on the lower-d signature s = 0. In other words, the choice of methods which properly respect 
the difference between space and time is absolutely crucial. 

The results in question arise due to the need in proper mathematical physics for well- 
posedness and not just existence and uniqueness; for CP's the domain of dependence (DOD) 
property is part of this, in order to have a sensible notion of causality. In our view causality 
can effectively be studied only in settings where the "independent dynamical variable" is a 
bona fide time. One reason for this is that given information on an arbitrarily-thin 
(3, 1) hypersurface,the (4, l)-d DOD is negligible because of this thinness (see fig 15). Other 
reasons are discussed below. 




Figure 15: Extending the 4-d DOD figure 6 [reproduced as a) above]. 

If one claims to know the 4-d DOD on-brane, one is precluding the influence of the extra dimension, thus 
making it largely redundant. The 5-d DOD of a thin sheet is small because of the thinness of the sheet. 
Thus predictability breaks down very quickly. This corresponds to the 4-d branewold EFE system not being 
closed. 

The (r, 1; 1) problem is a complicated sideways Cauchy problem. For these problems. 



well-posedness theorems remain notably undeveloped. There is quite simply no estab- 
lished way to proceed. There is no known sideways {s = 1) analogue of Leray's theorem. 
There are no function spaces known to be appropriate for sideways Cauchy problems. We 
can explain however why the applicability of Sobolev spaces to the GR CP does not carry 
over to sideways Cauchy problems. First there is no sideways notion of DOD to make the 
construction. Second even if we assume the higher-d DEC holds, it would not give an in- 
equality because the perpendicular vector in now spacelike (fig 16). Third, we obtain a 
difference of squares rather than the sums in (|165() . so the equivalent of the energy method's 
use of Sobolev norms is simply of no use to control the "evolution" given the data. 
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Figure 16: The bucket-shaped construction of Fig 7, and its meaningless sideways counterpart. 

1.2 Extra difficulty with sideways Campbell— Magaard 

The (r, 1; 1) Campbell-Magaard method encounters the following difficulty. The use of the 
strip < xi < rj as "evolutionary" data is generally invalidated by the information leak 
construction in fig 17, unless one has had the luck to construct a full global data set. But 
this would require the data construction to encounter no zeros nor to run into asymptotic 
difficulties. Thus, in fact signature does play a role even for the Campbell-Magaard result. 

1.3 Arguments against Campbell's theorem: conclusion 

After the above criticism, it is appropriate to conclude here about the Campbell-Magaard 
result. Although one can still state a form of Campbell-Magaard in these terms (about 
constructing an embedding for a small region to be determined outside which all prior 
knowledge of the metric is forsaken), this only permits a rigorous theorem at the price 
of exposing its application to building bulks as weak and undesirable. The result is also 
weak because of technical inappropriateness as described in 1.2.3.2, and because of lack 
of further supporting results (1.2.3.3 and B.1.2). Its application to construct physically- 
meaningful embeddings of known 4-d worlds is furthermore rendered questionable because 
it is so readily generalizable to all manners of matter-containing bulks, among which there 
are insufficient criteria for discrimination. 

Some of these poor features are Magaard-specific, and can be avoided by realizing 
Campbell's theorem otherwise. My Campbell-method 2 realization in 1.2.9.2.3 is at least 
for small regions known beforehand. My Campbell-irreducible method realization in 1.2.9.5 

^^The few simple results known for flat spacetime sideways wave equation problems are collected in [2]; 
these results might serve as a starting point for the study of the much more complicated nonlinear sideways 
"GR CP" system. So little is known about ultrahyperbolic equations that we deem it not sensible to talk 
about {r, s; e) procedures at present for s>lors = l, e = —1. We argue that s = 1 e = 1 is hard enough! 
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Figure 17: a) n is a (r, 1) (i.e timelike) hypersurface containing a data strip T built by the Magaard 
method from prescribed 'data for the data' on a timelike x\ = 0. The region Q. is that part of the (r + 1, 1) 
spacetime supposed to be entirely controlled by T i.e for which the Campbell method can validly provide the 
evolution. Typically T will not cover all of 11 in the a;i-direction, so there will be nearby points like p outside 
T. Consider the future light-cone with apex p. No matter how thin in the z-direction one considers Q. to 
be, it is always pierced by causal curves such as 7 on or in the light-cone, by taking 7 to be at a sufficiently 
slender angle 4> to H. Thus information can leak into from elsewhere than P, which is a contradiction. 
Therefore, in parts at least, Q. is arbitrarily thin in the z-direction. One can envisage that sometimes there 
will be enough points like p that there is no region Q, at all. 

b) Suppose = is spacelike and is a section of the entire hypersurface 11 and T' is the data strip built 
from it by the Magaard method. There is now no room for points like p\ However, typically T' will not cover 
all of n in the a;i-direction, so there will be nearby points like q in the causal past J~{P') not in P' . No 
matter how thin one considers fl to be, it is pierced by causal curves 7' from q which do not pass through 
P' , by taking these to be at a sufficiently slender angle </>'. So again, in parts at least, fl is arbitrarily thin 
and sometimes there will be enough points like q that there is no region at all. Small pieces of timelike 
hypersurfaces need not hold useful information for hyperbolic-type systems! 

is at least for a system that is everywhere vahd albeit the existence and uniqueness may 
break down in some regions, and is based on adequately covariant mathematics. Thus these 
superior forms of Campbell's theorem are 'about as badly behaved as the thin sandwich'. 
But a strong theorem would be for all regions, at least in many well-defined and substantial 
cases, like in York's work. A strong theorem would be for adequate function spaces and 
supported by further well-posedness theorems. 

About its application to 'noncompact KK theory', subject to the above restrictions, it is 
indeed always possible to embed small pieces of spacetime with a given metric into vacuum 
spacetime with one extra dimension. It is then claimed that the Campbell-Magaard theorem 
provides existence for the theory's Rab = equation to cover geometrization of any kind 
of matter [d(5& . just like KK theory is a geometrization of electromagnetic matter. As the 
Campbell-Magaard theorem relies entirely on the analytic functions which are inappropriate 
for relativistic theories, one must doubt that it is capable of providing support for any 
theory. Also, as the Campbell-Magaard theorem also holds (trivially) for whichever other 
choice of analytic functional form of higher-dimensional energy-momentum tensor, it in no 
way picks out vacuum spacetimes. Furthermore, this approach considers only 1-component 
('induced') matter; counting degrees of freedom shows that it cannot be extended to many 
important cases of fundamental matter. Whereas allowing for more extra dimensions could 
improve similar situations |366| 1382] . uniEcation requires geometrization of the fundamental 
matter laws themselves, whilst this 'induced matter' approach only geometrizes solutions 
of the EFE's coupled to matter of unspecified field dependence. ^'''^ 

100 Pqj- clarity, compare KK theory proper, in which the electromagnetic potential and Maxwell's equations 
are geometrized, as the frz portion of the 5-metric and the Grz equations of the KK split respectively, 
along with a scalar field. 'Noncompact KK theory' could geometrize a more general vector field than the 
electromagnetic potential in addition to the generalization of the KK scalar field that is all that is usually 



Finally I note that the Campbell-Magaard theorem does not appear to actually be used 
in any of its claimed applications to constructions: the messy Magaard part of the prescrip- 
tion is simply ignored (and even replaced by what turn out to be rudiments of the conformal 
method, to which I next turn). Unlike what is claimed in j329j and elsewhere, without such 
a specific prescription, all one is doing is applying standard embedding mathematics, and 
not some old but little- known result to modern scenarios. 

1.4 What parts of the conformal IVP method survive? 

Having argued much in their favour in 1.2.9, to what extent conformal methods can be 
adapted both to heuristic and to general constructions for (r, 1; e) data ? 

I treated the choice of the n-d generally-covariant, conformally- invariant trace-tracefree 
and TT-TL splits, and the subsequent decoupling of the Gauss and Codazzi equations in 
1.2.9.3, in a manifestly signature-independent fashion. The defocussing property of the 
underlying choices of slicing is signature-independent. 

But the study of the usual (3, 0; -1) Lichnerowicz equation has been specifically ellip- 
tic. Investigation of whether the s = 1 'wave Lichnerowicz equation' has good existence 
and uniqueness properties could be interesting. The natural setting for this is as a Cauchy 
problem (for even the 2-d wave equation is ill-posed as a Dirichlet problem). Assuming 
that there exists a Cauchy surface in the (3, 1) spacetime sense, one can attempt forward 
and backward evolution to produce a global data set (assuming also that the decoupled 
procedure for finding -fi'p^ also yields a global solution). Whilst this 'wave Lichnerowicz' 
procedure for the data could conceivably produce global data sets in some subcases and 
thus avoid the onset of information leak difficulties, the other difficulties described in B.1.2 
remain, since the next stage is still to be a sideways Cauchy problem. For example, ulti- 
mately the 'wave Lichnerowicz' problem would inherit the function space difficulty of the 
sideways CP that is to follow it. So we do not think the 'wave Lichnerowicz equation' is 
likely to be suitable as a general method. We favour instead the (4, 0; -1) approach in B.3. 

That said, the 'wave Lichnerowicz equation' may still serve as the basis of a useful 
heuristic method. We consider this in B.2.6. Note also that the outline of the carefully 
thought-out simplifications and tricks for the Lichnerowicz and Codazzi equations listed in 
C.2.2 hold regardless of signature. Considerations of ji = 0, cancellation of Lichnerowicz 
equation terms, conformal flatness, maximality, and of the 'Bowen-York' system of n -|- 1 
flat-space linear equations are all available. 

Also the study of the CMC LFE has been specifically elliptic, so the theory of existence 
of CMC slicings j33()| IHT] is likely to be signature-dependent. 

1.5 Other methods and formulations 

We note that the thin sandwich procedure of L2.8, L2.9.1 is (r, s; e)-independent. Thus 
we pose the thin sandwich conjecture for general (r, s; e). This is also considered for 
thin matter sheets in B.2.6. This is worth a try because of the concern with the sideways 
Cauchy problem being a bad prescription. Note that this suggestion signifies that the lower- 
d spacetime is a carrier of information about the extra dimension, which is now spacelike 
rather than a time. Many of the aspects of the Problem of Time will arise again in this 
context. 

considered. Now, this vector field is no known vector field of nature for its field equations does not in general 
correspond to that of any known vector field (although clearly the field equation contains the inhomogeneous 
vacuum Maxwell equations as a subcase since KK theory would be included in this way within 'noncompact 
KK theory'). For sure, this vector field (or even more obviously the scalar field) is not capable of being a 
simultaneous geometrization of the individual vector fields of the standard model. If one were to treat these 
fields collectively, one throws away all that is gained by keeping their classical identities separate: the results 
of Weinberg-Salam theory and QCD that are obtained then by quantization. 



The conformal thin sandwich scalings are again signature- independent, giving me an- 
other possible bulk construction formulation. Again however, the usual conformal thin 
sandwich mathematics is elliptic, so I do not know if this idea will work in the new sideways 
context. 

A (3, 1; 1) Hamiltonian formulation is not provided because although it is insensitive 
to e |23Uj (just a few sign changes), and to dimensional increase (just a few traces), it 
is sensitive to s through the intimate involvement of the space of geometries. Whilst the 
usual Hamiltonian formulation is based on the space of Riemannian geometries (Superspace) 
|376| 1106] , the space of semi- Riemannian geometries is reported to be not even Hausdorff 

1.6 Applying (r, 1; 1) methods to remove singularities 

We have argued above in favour of (4, 0; -1) schemes over (3, 1; 1) schemes. However, it 
is often suggested that (3, 1; 1) schemes may be useful since in certain particular examples 
(3, 1) spacetime singularities are 'removed' by embedding into nonsingular (4, 1) space- 
times j3U21 1155] . Whereas embeddings are undoubtedly useful tools in the study of (3, 1) 
spacetimes (for the purpose of algebraic classification |159| I223j ) , we ask what is the status 
of their use to remove cosmological singularities. Is it mathematically rigorous, generally 
applicable and physically meaningful? 

By 'singular' we mean geodesically-incomplete jl33j . In the study of singularities, time- 
like and null geodesies (t.g's and n.g's) are the only objects of fundamental importance 
since they are the curves privileged by the free travel of massive and massless particles 
respectively. The other objects associated with the study of singularities arise rather as a 
matter of convenience. 

Such objects include the expansion and shear of geodesic congruences, and curvature 
scalars. We focus first on these (B. 1.6.1), explaining how they arise in the study of singu- 
larities and then demonstrating how the embedding and embedded versions of these objects 
can be very different (B.1.6.2). We then discuss the more important issue of geodesic incom- 
pleteness in the context of embeddings (B.1.6.3). An example is used to illustrate some of 
these points and others (B.1.6.4). A lack of rigour in singularity-removal claims is uncovered 
in (B.1.6.5). We return to the case with thin matter sheets in B.2.7. 

1.6.1 Secondary objects in the study of singularities 

For (g, l)-d cosmology, smooth congruences of past-directed normal t.g's with normalized 
tangents denoted by are considered. The decomposition 

Bab = DaUb = -hab + (Tab + ^ab (724) 

provides the expansion 9 = h^-^Bab and shear aab = '^(^fe) • twist oj^b = B^ab] is zero for 
the normal congruences considered here, in which case Bab is an extrinsic curvature 9ab- The 
corresponding normal Raychaudhuri equation [c.f H159|) ] can now be considered. Although 
for ^0 > this would usually mean that a caustic develops, under certain global conditions 
a contradiction about the existence of conjugate points arises. Singularity theorems are 
thus obtained; in a cosmological context the simplest^'^^ of these is (Hawking, theorem 1 of 

There are a number of other cosmological singularity theorems |177l 136^ not only because some have 
weaker eissumptions, but also because one wants to be able to treat a number of pathological cases. One 
such is the Milne universe: although this naively looks like a cosmology with its focusing of geodesies normal 
to t =const as t — > 0, it is merely a region of Minkowski spacetime. So one would not want to include the 
Milne universe among the singular spacetimes. The way out of this is to demand that physically-meaningful 
cosmologies contain matter. This makes sense because the problematic physics that may be associated 
with geodesies focusing is the possible pile-up of matter travelling along these geodesies leading to infinite 



For globally-hyperbolic (3, 1) GR spacetimes obeying the strong energy condition and such 
that ^ = C > on some smooth (spacelike) Cauchy surface S, then no past-directed timelike 
curve from S can have greater length than 

By the definition of Cauchy surface, all past-directed t.g's are among these curves and are 
thus incomplete, so the spacetime is singular. Such theorems generalize to {q, 1) spacetimes 
(provided that these obey analogous energy conditions) in the obvious way since Raychaud- 
huri's equation clearly behaves in the same way for all q and the global part of the arguments 
uses topological space methods that do not care about dimension. 

It must be noted that the singularity theorems are about existence whilst saying nothing 
about the nature of the singularity. Ellis and Schmidt began to classify singularities accord- 
ing to their properties |133j . a difficult study which may never be completed 187^. Below 
we consider only the most elementary type of genuine singularities: curvature singularities, 
for which at least one spacetime curvature scalar such as R or RabR^^ blows up. 



1.6.2 Relating the embedded and embedding secondary objects 



Here we explain how knowledge of Gauss' hypersurface geometry renders it unsurprising 
that some singular spacetimes are 'embeddable' in nonsingular ones. This is because the 
behaviour of the higher- and lower-d objects used in the study of singularities clearly need 
not be related. As a first example, consider expansion. In the normal case, 

a ^ ^^"/S , ^ 1 dhab 



from which follows 9 = — ©ii • 

n 

(726) 

So a blowup in 9 need not imply a blowup in 0. Thus the (4, 1) spacetime perspective 
on focusing of geodesies can be completely different from the perspective on some (3, 1) 
hypersurface. Fig 18 shows how one's (3, 1) notion of expansion generally corresponds to 
expansion and shear from the perspective of an embedding (4, 1) spacetime. In particular, 
what look like caustics or singularities in the (3, 1) spacetime could well correspond to no 
such feature in some surrounding (4, 1) spacetime. 

As a second example, consider curvature scalars, in particular the Ricci scalar. Pre- 
liminarily, Gauss' outstanding theorem is a clear indication that extrinsic curvature can 
compensate for differences between higher- and lower-d intrinsic curvatures. Our simple 
idea is to consider the implications for rigorous embedding mathematics of the generaliza- 
tion of this to the case where lower-d curvature scalars become infinite. From the form 



(UMl) for the (3, 1; 1) Gauss constraint, 2p = 2R^^-R = -K'^-K^oK^-R 

(727) 

clearly - at least for some (3, 1) worlds which have Ricci-scalar curvature singularities 
R — > oo - there will be surrounding (4, 1) worlds in which R (and the 5-d p) remain finite. 
For it could be that o K"^ — jK^ — > — oo suffices to compensate for R — > oo. Thus 
it could be shear, expansion or both that dominate the compensation for R — > oo. If this 
involves \K\ — > oo, it means a (4, 0) caustic forms in the (4, 1) spacetime corresponding 
to the (3, 1) spacetime's singularity. Yet this is not the only nonsingular possibility; if 
R — > —oo it may be the shear that compensates, including cases in which the blowup is 
pure shear. 

densities. One succeeds in not including the Milne universe by use of singularity theorems hinging on 
the 'timelike genericity condition' Rabcdu'^u'' 7^ at least one point along each timelike geodesic, since the 
Milne universe is flat. 



flat FLRW surfaces z = constant 



b) 




Figure 18: a) A simple illustration of how 4-d expansion may be interpreted as being mainly due to shear 
in a 5-d embedding manifold. 

b) Diagram of the embedding of flat FLRW universes into Minkowski spacetime in the standard 5-d 
Minkowski coordinates {T, Xi, X2, X3, Z). The curved surfaces are the FLRW spacetimes. As one ap- 
proaches T = (corresponding to i = in FLRW coordinates), each of these surfaces becomes tangent to 
the light cone (characteristics of Minkowski). The foliation by these surfaces also becomes bad here because 
the surfaces intersect. Note also that the FLRW geodesies move within each of the surfaces whereas the 
Minkowski geodesies clearly pierce these surfaces. Thus the 4-d and 5-d geodesies in this example are not 
the same. 

Finally, clearly the higher-d singularity theorems hold so even if one were to succeed 
in excising singularities from a lower-d model, one would typically expect singularities to 
occur elsewhere in the resultant highcr-d models. The nature of these higher-d singularities 
may not be the same as that of the excised lower-dimensional ones. So, interestingly, by 
allowing extra dimensions, one would be even less certain of the character of singularities. 

1.6.3 embeddings and geodesic incompleteness 

The fundamental importance of geodesies is problematic for embeddings since in general the 
(3, l)-d geodesies are not included among those of an embedding (4, l)-d spacetime. There 
is then the dilemma of which of these sets of curves should be the physically privileged ones 
and thus be the set of curves whose extendibility is in question. 

If one wishes to postulate (4, l)-d GR, in addition to the (4, l)-d EFE's one must 
surely require that the matter follows the (4, l)-d geodesies (c.f 1.1.5). Then the issue of 
extendibility of the original (3, l)-d geodesies becomes irrelevant since they are unprivileged 
curves in the (4, l)-d spacetime. One would rather replace them with (4, l)-d geodesies, 
which might be viewed as unsatisfactory from a GR ontology. Note that this is at the 
price of altering what the effective (3, l)-d physics is: often it will be of affine-metric type 
and thus in principle distinguishable from ordinary (3, l)-d GR physics. This may contend 
with the recovered (3, l)-d physics being as close to the usual (3, l)-d physics as is widely 
claimed in the braneworld literature. 

Suppose instead that one postulates that both (3, 1)- and (4, l)-d geodesies play a 
part in braneworld models, as the paths of brane-confined and bulk-traversable matter (or 
gravity waves) respectively. It is then not clear at all what is meant by a singularity - 



exactly which curves are supposed to be incomplete? 

Finally, (3, 1) incompleteness might correspond to a (3, 1) hypersurface that becomes 
tangent to the characteristics of the bulk equations in which case t.g's and n.g's are being 
forced to exit that (3, 1) hypersurface. Also, the (3, l)-d geodesies might be of (4, l)-d 
geodesies which are extendible only by replacing a piece of the original foliation with a new 
one that extends into what was originally regarded as the extra dimension. 



1.6.4 Embedding fiat FLRW in Minlcowslci 



The following example illustrates many of the above points and leads us to further com- 
ments. Consider a (4, 1) spacetime with metric (see e.g j3U2p 



.,2+2 



2 2 g^t^ 



gAB = diag{gu, e„^, g^^) = diag ( -z^, ti z'^-'^ Sap, ,^ " ) , q > l- (728) 



This is simple to treat because: 

1) If we foliate it with constant z hyper surf aces, a portion of each z = const hypersurface 

has induced on it a flat FLRW cosmology. In particular, the coordinates have been chosen 

1 

so that the z = 1 hypersurface is the FLRW cosmology with scale factor t « . We restrict 
attention to g < 3 due to the DEC. For q ^ 2 there is a (3, 1) Ricci scalar curvature 
singularity (g = 2 is the radiation universe, whence R = S = 0). 

2) The (4, 1) spacetime is in fact Minkowski. 

First, foliating the flat FLRW cosmology with constant t surfaces, ^ = ^ — > cxd as 
t — > 0, so there is (3, 0; -1) focusing as the Big Bang is approached. Furthermore, only 
focusing occurs: 9'^^ = 0. Next foliate the (4, 1) spacetime with constant t surfaces. The 
(4, 1) spacetime contains many FLRW worlds on constant-z surfaces. Build a congruence 
by collecting the FLRW geodesies on each z = const slice. Then the (4, 0; -1) expansion 
e = ^ also blows up as t — > oo, but there is also a blowup of the corresponding 
(4, 0; -1) shear: 

= l^diag(-e,^, 3<7..) . (729) 

Also, whilst still both 9 — > oo and — > oo in this case, we know that the former viewed 
from within the z = 1 (3, 1) hypersurface corresponds to a genuine (3, 1) singularity, whilst 
the latter in (4, 1) would be a mere caustic. 

Second, consider the z = 1 slice of the (4, 1) spacetime. Here the (3, 1; 1) expansion 
and shear are 

K = i-^ , K^^ = -^diag(3, tU^s). (730) 

Thus for the physical range, both blow up as t — > 0. So the spacetime includes a z = 0, 
t = point, at which there is a (4, 0) caustic. These blow-ups combine in K^^ oK"^ — to 
cancel R for all values of q; for q = 2 the shear and expansion contributions exactly cancel 
each other. The unphysical g = 4 case is an example of a pure shear blowup. 

Although the congruence two paragraphs up is built to naturally include the geodesies of 
all the included FLRW universes, these turn out not to be the (4, 1) geodesies (nor pieces of 
them). Rather, by considering the coordinate transformation between (|728() and standard 
(4, l)-d Minkowski coordinates, it is easy to show that the Minkowski geodesies pierce the 
z = constant surfaces that are the FLRW universes (more significantly in the early universe 
than today). As t — > the z = constant surfaces approach the null cone so the example is 
an illustration of the foliating (3, 1) hypersurfaces becoming tangent to the characteristics 
at the point of interest. Also the foliation breaks down as t — > because the family of 
hypersurfaces of constant z intersect at t = 0. All these points are illustrated in Fig 18b). 

Finally, the example cannot be taken to be typical by the (4, 1) version of genericity. 
Indeed, we know that not many (3, 1) spacetimes can be even locally embedded into 



(4, 1) Minkowski \22'6\ . The value of particular examples is limited to exhibiting possible 
behaviours. One would require this to be upgraded to the study of large classes of exam- 
ples to assess probable behaviours. Indeed, one would expect higher-d cosmological models 
to exhibit a range of behaviours distinct from that of the Big Bang (including nonsingu- 
larness). This follows from knowledge of particular examples (from the algebraic study) 
of inhomogeneous cosmologies in standard GR ^224. We especially note that if highly- 
symmetric cosmologies are haphazardly embedded, one would often expect the resulting 
higher-d models to be less symmetric i.e more anisotropic and inhomogeneous than the 
original lower-d models [as naively suggested by the appearance of (4, 0; -1) shear from 
models containing no (3, 0; -1) shear]. So the resulting higher-d models are expected occa- 
sionally to exhibit more unusual behaviour than the lower-d models from which they arose, 
and this includes the possibility of nonsingular cosmologies. Indeed, we would expect that 
an increase of dimension increases the variety of possible behaviours in inhomogeneous cos- 
mology. But the bottom line in the study of 4-d and 5-d singularities is the same: one is 
ultimately interested in generic behaviour and this requires more sophisticated means of 
study than building individual particular examples. 

1.6.5 Nonrigorousness of singularity removal by embedding 

So, there are arguments against the use of embedding results toward making general state- 
ments about singularity removal. Some of our arguments against the use of the Campbell- 
Magaard result (some of which also hold against the use of further embedding theorems) are 
relevant to this context. In particular, given a spacetime, there are so many possible embed- 
dings. Were nature higher-d, why should it choose a particular nonsingular embedding out 
of an infinity of (nonsingular or singular) embeddings that mathematically exist? The more 
extra dimensions are present, the more severe this nonuniqueness is. This makes singularity 
removal by such embeddings physically-questionable. The Campbell-Magaard scheme, and 
other embedding schemes such as the 10-d Minkowski embedding theorem |159j ). are local. 
This localness includes the notion that the theorem is only applicable to a small region of 
the original manifold. We question whether this need include the approach to a singularity, 
since these are edges of the lower-d spacetime and thus have no neighbourhood. Singulari- 
ties are global features of spacetime. However even some of the global embedding theorems 
(which require very many extra dimensions) are stated for the analytic functions jl59j . But 
low differentiability may be typical in the approach to singularities 87^. It becomes much 
harder to make any general statements once one accepts that spacetime is not analytic! 

One may view at least some (3, 1) singularities as projective effects due to taking a badly- 
behaved foliation (for example a foliation that becomes tangent to the characteristics), 
manifested by the formation of caustics or pure shear blow-ups. The question is why 
there is confusion in reversing this projection procedure to embed (3, 1) singularities into 
nonsingular (4, 1) spacetimes. If an embedding of the (r, s; e) type such as those considered 
in this section is to be used, it is mandatory that the region of hypersurface to be embedded 
be entirely of one signature s (the Cauchy-Kovalevskaya theorem demands a nowhere- 
characteristic lower-d manifold). Also, from the lower-d perspective, the singularity cannot 
be included in the set of points on which data are prescribed because it is an edge of 
spacetime, not a point of spacetime. Were one to try and include it by providing data 'right 
up' to that edge, one would expect that the data would become badly-behaved e.g some 
components of K^a could be forced to be infinite. Under the various circumstances above, 
embedding theorems become inapplicable to reconstruct the higher-d spacetime. 



2 (r, 1; 1) methods with thin matter sheets 



The warpfactor split |3U7j 



is a simple subcase of the z-dynamics scheme, in which the metric is allowed to z-evolve 
only in its scale, away from the z = hypersurface where it is taken to be known. Then 

Qf^/'^ dlnW 
(fT371) leads to — = -f^''^^K which gives the equation — — = -(^K 

oz oz 

(732) 

for the warpfactor. For example, using the ansatz XrA = CfrA in (|732j) gives an exponential 
Randall-Sundrum type warpfactor. Whereas the split (|731|) does not cover very many 
cases, our scheme exhibits generalizations for it: to permit the whole metric to evolve and 
to recognize the gauge freedom in /3r, which should ideally be used to separate coordinate 
effects from true physics in the spirit of j336j . 

Full, overtly z-dynamical schemes are used in particular examples for domain walls j57j . 
braneworld black holes (such as for the pancake or cigar bulk horizon shape problem j82j ) 
and in models for braneworld stars |38H I362j . Whereas in the Randall-Sundrum model the 
higher- and lower-d cosmological constants balance out leaving vacuum (in fact Minkowski 
spacetime) on the brane, more generally a brane would consist of a thin sheet of matter 
- a junction. We study such [Shiromizu-Maeda-Sasaki (SMS)-type] braneworlds below,-^^^ 
starting first however with a careful recollection of where the underlying junction conditions 
come from. 

2.1 Origin of the junction conditions 

Assume that we have a {D - l)-d thin matter sheet in a D-d bulk with one extra spatial 
dimension (e = 1). Our discussion follows |273j most closely whilst keeping the unraised 
index positions of Israel's work |2n5j . 

Whereas the requirement of well-defined geometry dictates that the metric is continuous 
across the thin matter sheet yielding the junction condition (j.c) 

[frA]t = /r+A - fvA = , (733) 

discontinuities in certain derivatives of the metric are permissible. Consider then the 3 
projections of the Einstein tensor Gab- We use the e = 1 cases of the Codazzi and Gauss 
constraints (|149|) and (|148j) for Gr± and G±± respectively. For GrA, the following con- 
struction is used. 

One begins by writing down the contracted Gauss equation (|141j) and subtracts off ^/rA 
times the doubly-contracted Gauss equation - ()144() : 

^rA - R±r±A= RvA - KKta + Kv^K^k 

R f 4- K . . f^. R f-r.^ 4- K^-KoK f y '■^^1 

--2 JrA + -K±±/rA - 2"-'rA + ^ ■ 

The following steps are then applied. 

Step 1: The Ricci equation ()14U() is used to remove all the i?_Lr±A- 

Step 2: The contracted Ricci equation H143|) is used to remove all the -R_l_l. Thus 



Rta 

_R 
2 



^/rA J + 



hAr Aah^ 

frA V -2"/rA J + 2 /rA 



RrA \ - KKrA + Kt^K AA 



(735) 



^This section is based on |15| and |16| . 



This is then rearranged to form the "GR CP" geometrical identity (|147l) 

(736) 

hm - 

Then performing ^ y GAsdz one obtains the j.c's 

[Gxjl = 0, [Gr±]l = 0, (737) 

[GTA]t = [KrA - kAK]t . (738) 

The derivation of this last equation makes use of normal coordinates (in which case the 
hypersurface derivative 6^ becomes the normal derivative ^) and the rearrangement to the 
'Israel' geometrical identity 

Gta = Gta + KKta + 2Kr^KAA + ^'^5°^ /rA + |- {KrA - IfaK) (739) 

2 oz 

via the definition of extrinsic curvature ()137() to form the complete normal derivative 

^{KvA-fvAK) . 
oz 

Step 3: One then further assumes that the (4, l)-d EFE's hold. If one then 

lim - 

additionally uses the thin matter sheet energy-momentum Yab = q / T^sdz, 



one obtains the j.c's = Y±± , = Yr± , 



(740) 



[KrA]t = (vrA - ^^fr^) (^41) 
(performing a trace-reversal to obtain the last equation). 

2.2 SMS's braneworld EFE's 

We next recall the method SMS use to obtain their BEFE's j332j . They begin by forming 
the (3, l)-d Einstein tensor GrA just like we obtain (|734j) above. SMS then apply three 
steps to this equation. 

Step SI: Using the definition of the Weyl tensor, -R_Lr±A is replaced by the electric part of 
the Weyl tensor, ErA = W±r±A and extra terms built from the projections of Rab- 
Step S2 ( = Step 3 of the above subsection): The (4, l)-d EFE's are then assumed, which 
permits one to exchange all remaining projections of Rab for (4, l)-d energy-momentum 
terms. Only when this is carried out does (|734() become a system of field equations rather 
than of geometrical identities. We refer to the field equations at this stage as "timelike 
hypersurface EFE's" (THEFE's),^''^ as opposed to the braneworld EFE's which arise at the 
next stage. 

Step S3: A special subcase of THEFE's are braneworld EFE's (BEFE's), which are obtained 
in normal coordinates by choosing the (thin) braneworld energy-momentum tensor ansatz 

Tab = YabS{z) - Kqab , Yab = (Tab - X/ab) , Tabz"^ = , (742) 



losrpj^jg choice of name reflects their superficial resemblance to the (3, l)-d EFE's, although as discussed 
below, this resemblance is only superficial. 



where TpA is the energy-momentum of the matter confined to the brane. This is a special- 
ization due to the specific presence of (3, 1) and (4, 1) cosmological constants A and A, and 
by A being the only bulk contribution. See B.2.5 for more about this. 

One then adopts the j.c's ()733|) . H737() . and H741|) with the additional supposition of Z2 



,.104 



symmetry: 
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(743) 

where the (4, l)-d gravitational constant G5 has been made explicit. Then SMS's BEFE's 



read 



/~f T SMS I r\i 

Gta = -i^rA +V: 
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(744) 

where QrA(T) and LrA(^) are the terms quadratic in, and linear together with zeroth 
order in T^a respectively, given by 
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Trn^^^A + ( — 7^ — 1 /rA 



(745) 



Lff = + T^') + f ^^^^ • ^^^^^ 

As opposed to the (3, l)-d EFE's, SMS's BEFE's are not closed since they contain the 
unspecified electric part of the Weyl tensor -EpA- Although it also contains 15 equations, 
the SMS BEFE-Gauss-Codazzi system is not equivalent to the (4, l)-d EFE's: indeed SMS 
write down further third-order equations for the "evolution" away from the timelike brane 
of ErA, by use of the z-derivative of the contracted Gauss equation H141|) . Bianchi identities 
and the Ricci equation H14U|) . This then involves the magnetic part^^^ of the Weyl tensor 
Baat = W±rAA, the "evolution" of which follows from further Bianchi identities. This full 
brane-bulk SMS system is then closed. 

Step 4: In practice, however, instead of the difficult treatment of this third-order system, 
other practitioners have often worked on the SMS BEFE's alone. This involves the ad 
hoc prescription of the functional form of E^a (sometimes taken to be zero^'^^). In fact 
-ErA is often first decomposed according to a standard procedure |2641 \2&I\ . Because the 
original functional form is unknown, the functional forms of each of the parts defined by the 
decomposition is also unknown. Some of these parts are set equal to zero whereas other 
parts are restricted by unjustified but convenient ansatze. In particular, the anisotropic 
stress part PpA is sometimes set to (see e.g the references 7 in |^), or otherwise restricted 
(see the references 8 in |15p. The radiative perfect fluid part is often kept, but is then 
argued to be small (despite containing an unknown factor) in the circumstances arising in 
the inflationary |264j and perturbative j262| treatments. Having dealt with E^a in one of 
the above ways, the form (|745|) of Qp^^ is then often taken to be uniquely defined and the 
starting-point of many works on brane cosmology j53j i25Jl IMl 13211 12U91 17^ 12631 1189j . 

However, SMS's procedure is far from unique. It turns out that there are many refor- 
mulations of the BEFE arising from geometrical identities. Each has a distinct split of the 
non-Einsteinian BEFE terms into 'bulk' and 'brane' terms. Whereas all these formulations 
are clearly equivalent, their use helps clarify how to interpret SMS's braneworld. Were 

i04rp]^g sign difference between II74H and SMS's paper is due to our use of the opposite sign convention 
in the definition of extrinsic curvature. We compensate for this in subsequent formulae by also defining 
Kab = -K^i rather than +K^^. 

^°^As the 5-d alternating tensor has 5 indices, one has two choices for the number of indices in what is to 
be taken to be the definition of the magnetic Weyl tensor. The above is the 3-indexed definition; the other 
possible definition is H^'^ = e'^^^Brsn. 

^'^^It is zero for example in the references 5 of I15| . as well as in the restricted setting of an AdS bulk. 

lOTrpj^g unknowns are sometimes kept, for example the 'Weyl charge' for black holes in |94|. 



one to truncate the 'bulk' terms in each case (in direct analogy with the usual practice of 
throwing away the Weyl term), then the BEFE's obtained in each case would generally be 
inequivalent. 

2.3 Ambiguity in the formulation of the BEFE's 

The Weyl term in SMS's BEFE's has been the subject of much mystery. Our aim is not 
to argue about what functional form the Weyl term may take (e.g zero everywhere) in 
particular solutions of the system. Rather we show that different formulations of the full 
brane-bulk system exist which - although completely equivalent ~ lead to very different 
splits of the BEFE's non-Einsteinian terms into 'brane' and 'bulk' terms. Some of these 
reformulations have no explicit Weyl terms present in their BEFE's. We first remove some 
misconceptions as to how SMS's procedure leads to a BEFE containing a Weyl term. Does 
it have anything to do with the modelling of braneworld scenarios? No, for the Weyl term 
is already in the SMS THEFE before the braneworld energy-momentum ansatz is invoked. 
Furthermore, all the procedures used in SMS's method are independent of signature and 
dimension. Thus this issue of a Weyl term must have already arisen long ago in the study 
of the GR CP. So why is there no manifest Weyl term piece in the GR CP formulation of 
the EFE's? The answer is simple, and has already been used in 1.2.2 and B.2.2: in the "GR 
CP" and Israel procedures, one uses the Ricci equation il4f)\) to remove the R±r±A terna. 
If there is no early use of the Ricci equation, one is left with E-pA in the THEFE's, which 
requires later use of the Ricci equation to "evolve" it. 

So there is a choice as to whether one formulates the BEFE's with or without an explicit 
Weyl term. In the usual treatment of the split of the EFE's (1.2.2), one does not use an 
explicit Weyl term. In the (4, l)-d case, this gives a well-understood system of 15 p.d.e's 
in the variables /fa- The option of using an explicit Weyl term gives a considerably larger, 
more complicated system of p.d.e's with variables frA, -E-rA and -BpAA- 

Although this SMS system looks somewhat like the threading formulation (1.2.12.1), it 
does not appear to be tied to the idea of deliberate incompleteness. The use of this idea 
in the threading formulation is clearly tied to signature-specific physical reasons which do 
not carry over to the signature relevant to SMS's equations. We also note that some other 
third-order reformulations of the (3, 0; —1) split of the EFE's are sometimes used to seek to 
cast the EFE's into hyperbolic forms that manifestly have theorems associated with them 
jl45j . Whereas this is precisely the sort of result that is spoiled by considering instead a 
sideways split it serves to illustrate that what at first seems a 'mere reformulation' of a 
set of equations can in fact be used to prove highly nontrivial theorems. So, whereas similar 
complicated formulations have been used elsewhere in the GR literature, SMS's unstated 
motivation to have a complicated formulation does not coincide with the motivation else- 
where in the literature. Below we bring attention to many reformulations of SMS's system, 
so we ask: what is the motivation for the original SMS formulation? Should the use of some 
simpler second-order formulation be preferred? Is SMS's formulation or any other third- 
or second-order formulation singled out by good behaviour, either in general or for some 
particular application? 

Also, from first principles the SMS procedure to obtain their BEFE is quite complicated. 
For, since they use the j.c obtained by the Israel procedure, their procedure actually entails 
beginning with the whole Israel procedure (Steps 1 to 3 of Sec 4.1), and then choosing to 
reintroduce Rr±A± and -R_l_l by reverse application of Steps 1 and 2. This is followed by the 
Weyl rearrangement (Step SI), the use of the EFE's (Step S2) and the substitution of the 
j.c into the extrinsic curvature terms in the braneworld ansatz (Step S3). However, despite 
being complicated, all is well with SMS's scheme since any BEFE's obtained by other such 

^"^Having argued that the second-order sideways "GR CP" is not known to be well-posed, we should add 
that it is unlikely that third-order formulations are likely to be better-behaved in this respect. 



combinations of careful procedures will always be equivalent because the different steps are 
related by geometrical identities. 

Step 4 however is not an instance of being careful as it is a truncation. Our first 
point is that whereas in SMS's formulation the non-Einsteinian terms in the BEFE might 
be regarded as a bulk-like -EpA ^-nd a term quadratic in the brane energy-momentum, in 
other formulations the content of these two terms can be mixed up. In general, BEFE's 
contain a group of non-Einsteinian 'bulk' terms we denote by B^a (which include both 
Weyl terms and normal derivatives of objects such as the extrinsic curvature), and a group 
of non-Einsteinian 'brane' terms that depend on the brane energy-momentum. Thus any 
temptation to discard the Weyl term in the SMS formulation (on the grounds that it involves 
the unknown bulk over which one has no control) should be seen in the light that if one 
considered instead a reformulation, then there would be a similar temptation to discard the 
corresponding 'bulk' term, which would generally lead to something other than the Weyl 
term being discarded. Thus for each formulation, the corresponding truncation of the 'bulk 
term' would result in inequivalent residual 'braneworld physics'. This is because there are 
geometrical identities that relate 'bulk' and 'brane' terms, so that the splits mentioned above 
are highly non-unique and thus not true splits at all. We take this as a clear indication 
that any such truncations should be avoided in general. Instead, the full system must be 
studied. 

Our second point is that each possible bulk spacetime may contain some hypersurface on 
which a given -BpA vanishes. Then if one identifies this hypersurface with the position of the 
brane, one has a solution of the full brane-bulk system and not a truncation. For example, 
in any conformally-flat bulk, by definition Wabcd = and therefore i?p^^ = —E^a = on 
all hyper surf aces. Thus any of these could be identified as a brane to form a genuine (rather 
than truncated) —ErA = braneworld. From this, we can see that the SMS formulation 
is particularly well adapted for the study of conformally-flat bulks such as pure AdS. This 
motivates SMS's formulation as regards this common application. However, also consider 
repeating the above procedure with some -BpA 7^ —E^a- This would correspond to a genuine 
(rather than truncated) braneworld model with distinct braneworld physics from that given 
by SMS's particular quadratic term. Note that given a model with some B^a = 0, the 
BEFE formulation for which i?rA is the bulk term is particularly well adapted for the study 
of that model. Thus different formulations may facilitate the study of braneworlds with 
different braneworld physics. In the context of conformally-flat spacetimes, it is probably 
true that the —Eta = braneworlds outnumber the braneworlds for which any other (or 
even all other) B^a = since these other conditions are not automatically satisfied on 
all embedded hyper surf aces. Rather, each of these other conditions constitutes a difficult 
geometrical problem, somewhat reminiscent of the question of which spacetimes contain a 
maximal or CMC slice |267j . However, generic spacetimes are not conformally-flat. For a 
generic spacetime, we see no difference between the status of the condition '— E'rA = on 
some hypersurface' and the condition 'any other particular i?rA = on some hypersurface'. 
Because braneworlds constructed in each of these cases have a different residual quadratic 
term and thus a propensity to have distinct braneworld physics, and because we do not 
know how frequently each of these cases occur, we question whether anything inferred 
from conformally-flat models with the SMS quadratic term need be typical of the full SMS 
brane-bulk system. Confirmation of this would require study of the full range of difficult 
geometrical problems B^a = 0, and the construction of concrete examples of non-SMS 
quadratic term braneworld models together with the assessment of whether their braneworld 
physics is conceptually and observationally acceptable. 

For the moment we study what is the available range of reformulations and thus of i?rA- 
To convince the reader that such reformulations exist, we provide a first example before 
listing all the steps which are available for reformulating the BEFE's. 

Assume we do not perform all the steps implicit within SMS's work but rather just the 



Israel steps to obtain the j.c and then use it in the field equation (|7ci9|) that gave rise to it 
(as done in ^Sj), or (as done below) use it in the "GR CP" field equation following from 
l|736|) . In other words, why not apply the braneworld ansatz to e.g the Israel or "GR CP" 
formulations rather than to the SMS formulation? In the "GR CP" case we then obtain 

GrA = ^rA + QrA + ^rA , with (747) 



Qfa = [SeTr^TnA - ISTTta + {9ToT + T^)frA] (748) 
^rA = %TX- 2A2)/rA + G^iTrA - (A + A) /fa] (749) 

This example serves to illustrate that choosing to use a different formulation can cause 
the 'brane' quadratic term QrAiT) to be different. Also note that this formulation makes 
no explicit use of the Weyl term. Thus this BEFE, along with the Gauss and Codazzi 
constraints, forms a small second-order system, in contrast with the much larger third- 
order SMS system. 



Now we further study the list of steps ^| which may be applied in the construction of 
BEFE's. 

Steps SI and 3 together mean that the Weyl 'bulk' term Eya is equivalent to the Riemann 
'bulk' term together with matter terms. This swap by itself involves no terms which are 
quadratic in the extrinsic curvature. 

Step 1 says that the Riemann 'bulk' term is equivalent to the hypersurface derivative of the 
extrinsic curvature together with a KrnK^A term. 

Steps SI and 3 together say that the hypersurface derivative of the trace of the extrinsic 
curvature is equivalent to a matter term together with a K o K term. 

Furthermore, one can use both Steps 2-3 and Step 1, on arbitrary proportions 
(parameterized by fi and v) of R±r±A and of R±±: 



Gta = Gta - (1 + i^)R±r±A + (1 - f^)R±±frA 



+ - 

a 



+ KKta + l)Kr^KAu - —frA +i^--fijKo KfrA • (751) 

This introduces freedom in the coefficients of the KmK^A and K o K contributions to the 
quadratic term QrA{K) in the THEFE's. We next find further freedom in QrA by choice 
of the objects to be regarded as primary. 

We are free to choose a 'bulk' term described by hypersurface derivatives 5^ (which are 
partial derivatives ^ in normal coordinates) of objects related to the extrinsic curvature 
Kta by use of the metric tensor (including its inverse and determinant /). The underlying 
reason for doing this is that it is just as natural to treat such an object, rather than the 
extrinsic curvature itself, as primary (see below for examples). 

Upon careful consideration, there are three separate ways such objects can be related 
to the extrinsic curvature: raising indices, removing a portion of the trace by defining 
rjKrA = KrA — fJ^frA, and densitizing by defining ^Kya = x Kya- The hypersurface 
derivatives of these objects are related to those of the extrinsic curvature by 

SffKrA = hifmK^A) = frn^K^^ " ^aKrnK^A , (752) 



6f^KrA = Sf^K^^ + r](^5^KfrA-2aKKrA) , (753) 

S^KrA = {fr^'6^{f^'KrA) + 2a^iKKrA • (754) 
Further useful equations arise from the traces of these: 

5^K = f^SpKvA + 2aK o K , (755) 
= r^t/'^^'^^r^^rA - 2ar/(i^2 _ ^ „ K)] , r/ / i , (756) 

= {fr^H'pU^^K) + 2a6i^' , (757) 
where the S^K in ()756() and H757() has been obtained via 1)7551) . 

The fohowing examples of ^Kya illustrate that the use of such objects is entirely natural: 
\i^Kya is the Ky/s^ commonly used in the IVP literature, and o^^-^^rA appears in the guise 
of forming the complete normal derivative in the Israel procedure. Also, the "gravitational 

momenta" are pya = —i'^Kya- 

The above thorough consideration of possible 'bulk' terms permits all four THEFE terms 
homogeneously quadratic in the extrinsic curvature to be changed independently. One may 
think that we have a redundancy in providing 8 ways to change only 4 coefficients. However, 
one can afford then to lose some of the freedoms by making extra demands, of which we 
now provide four relevant examples. First, one could further demand that there is no Weyl 
term in the THEFE's (as discussed in B.2.4). Second, unequal densitization of K^a and K 
(i.e ^1 7^ C2) corresponds to interpreting the fundamental variable to be some densitization 
of the metric rather than the metric itself. Whereas this is again a common practice (for 
example the scale- free metric of the IVP literature is = /^"/pA in dimension n), the 
use of such an object as fundamental variable does appear to complicate the isolation of the 
(3, 1) Einstein tensor truly corresponding to this fundamental variable. Thus this option is 
not pursued in this thesis. Third, one may start by declaring that one is to use particular 
well-known primary objects (such as the "gravitational momenta") and still desire to be 
left with much freedom of formulation. Fourth, one could declare that one is to use the 
raised objects given by H755|) . in which case the further ability to change coefficients by use 
of ()753jl is lost, since moving a Kronecker delta rather than a metric through the derivative 
clearly generates no terms quadratic in the extrinsic curvature. 

2.4 Examples of formulations of BEFE's with no quadratic terms 

As a consequence of the above freedoms, there are many formulations in which all four 
coefficients vanish, and hence Qta{K) = 0. From this it follows that QrA{T) is zero [and it 
is easy to show that all instances of Qta{T) = follow from QrA{K) = 0]. Thus it suffices 
to seek for cases of THEFE's with QrA{K) = to obtain all cases of BEFE formulations 
that have no quadratic term. We now motivate these formulations and then choose to 
exhibit three that comply with some of the extra demands in the previous paragraph. 

The diversity of 'brane'-'bulk' splits ensures that truncations such as Step 4 produce all 
possible combinations of quadratic terms as residual 'braneworld physics'. Alternatively, one 
may suspect that there might be solutions to the full brane-bulk system that just happen 
to have a particular B^a = on some hypersurface which is then identified as a brane. 
We speculate that each of these situations will often lead to different answers to questions 
of physical interest. Whereas most FLRW perfect fluid models with equation of state 
P = (7 — l)p arising thus will be similar, differences will be more salient in models with more 
complicated equations of state, in perturbations about FLRW (as started in [2621 12481 1^ 



in the SMS-adapted case) and in anisotropic models (as started by |721 12(331 11891 15H] in the 
SMS-adapted case). These in turn constitute natural frameworks to seriously justify the 
late-time emergence of FLRW behaviour and the likelihood of inflation j91[ 13211 l2U9j as well 
as the study of singularities |SH1 EHl on the brane. We emphasize that, for a satisfactory 
study of whether any particular full brane-bulk (as opposed to truncated) case leads to any 
differences from the hitherto-studied —E-pA = case, one would require a full brane-bulk 
solution explicitly constructed to satisfy some BpA = on some hypersurface within a 
particular given bulk spacetime. Since we currently have no such example, our arguments 
currently only support the far simpler idea that truncation should be avoided. 

We illustrate that in different formulations, the truncation of the corresponding 'bulk' 
terms can lead to big differences in the residual 'braneworld physics', without any of the 
above lengthy calculations. We do this by formulating the 'bulk' part so that there is 
no corresponding Q term at all. Thus these truncations give the 'p' of standard FLRW 
cosmology rather than the 'p and p^' of brane cosmology |53| I264| I2(i2j . As a result whether 
we have a '/O and p^' brane cosmology depends on the choice of formulation. So we argue that 
since the SMS procedure followed by truncating the Weyl term is a hitherto unaccounted- 
for choice out of many possible procedures, then adopting the particular homogeneous 
quadratic term of SMS (often taken as the starting-point of brane cosmology) appears to be 
unjustified. Rather, we conclude that no particular truncation should be privileged as the 
act of truncation imprints undesirable arbitrariness into the study of the truncated system. 
Whereas the (3, l)-d trace of Er/\ is zero and thus might^'^^ phenomenologically look like 
pure radiation fluid to observers on the brane, other bulk characterizations would typically 
not look like a pure radiative fluid. This may open up phenomenological possibilities. 

Also, before further study of SMS's full (untruncated, third-order) system is undertaken, 
some of the reformulations along the lines suggested in B.2.3 might turn out to be more 
tractable. In particular those reformulations which fully eliminate R±r±A by the early use 
of the Ricci equation are already closed as second-order systems. These include the Israel 
formulation in 15 , the "GR CP" formulation H747H75(]|) . and our second and third examples 
below, which contain neither a Weyl term nor a quadratic term. 

For our first example, we take as the primary object the antidensitized extrinsic curva- 
ture K^f, = so that the 'bulk' term is (partly) a combination of this object's normal 
derivatives. The corresponding BEFE's are: 

GrA = Lta + BrA , (758) 



^rA = ^ + ^ (5f^x - t) frA , BrA = -2^rA + Vf - ^f^^frA 

(759) 

(where we have chosen to remove all projections of Rab by the EFE's). 

To derive this, take (|75H) in normal coordinates. Choose to convert all of the 

§ into /--^ by 

Gta = GrA - (1 + J^)R±r±A + (1 - f^)R±±frA + (^^~^^ + 

+KKrA + l)Kr^KMi - ^fvA + (^ + l^)K o KfrA • (760) 



^"^This is if one treats the geometric content of the Weyl tensor as an effective or 'induced' energy- 
momentum. People often also assume that this ought to behave like a perfect fluid, although Maartens' 
decomposition of ErA |262| . permits a more general treatment. 



Now choosing the primary object to be some densitized ^i^rA by (|754|) we have 
GrA = GrA - (1 + i^)R±r±A + (1 - ^)R±±frA + (-^^ + /^/^^^^ 



(761) 



so clearly /x = — ^ , = 1 and the antidensitization choice of weighting i = —\ ensure that 
Qva{K) = 

The following examples arose from asking if it is possible to find examples in which 
neither QrA{T) nor E^h. feature. We found the following BEFE's: 



GrA = -^rA + ^rA 



(762) 



1 / . 



1 (idK^ dK^T , ' 

77 1^2 57^^^ " ^^^"^^ 



(763) 



by considering as our primary object the densitized extrinsic curvature with one index 
raised, K 1, = y/hK"'h. 

We obtained these BEFE's by arguing as follows. In order for the BEFE's to contain 
no Weyl term, v is fixed 

to be —1. Then the only control over KrnK^^ is from raising by (|752() . It is easy to show 
that this raising must be applied to the whole ^^j^ in order for the coefficient of K^nK^A to 
be zero. Then using r]K^A does not change any terms quadratic in the extrinsic curvature. 
Also, use of distinct densities for K and K^a does not appear to make sense since both 
quantities are related to K^a by a single use of the inverse metric. Although all these 
restrictions make the outcome unlikely, the use of H754|) and (|757() alone suffices to obtain 
the above example: 



GrA = GrA + (1 - ^)R±±frA + f'^ 



d^KvA , dK ^. 

— ^ ^ JrA 

oz oz 



+(1 - 2i)KKrA + (2e/i - \)K^fvA + {\- f^)K o KfrA 



(764) 



which has no quadratic terms if /i = 1/2 and ^ = 1/2 ('densitization' weight). 

Another possibility is to replace ^K^a by rjK^ a- Although this does not immediately 



do anything about the quadratic terms, if we also convert a portion tt of into "''g/^ f 
we obtain 



GrA = GrA + (l-/u)i?±±/rA 
d^K^A 



dz 



+ {fi-r]) 



TT 



/^^ + (1 - vr: 



1 — 4?7 dz 



dz 



fvA 



+{l-2i)KKTA+ 



2^/2 



1 2-K7]{fI — T]) 

2 1 - 4r/ 



1 27rr/(/i-r/) 



which requires ^ = h, whereupon the two remaining equations become identical: 



1^ 



1 27rr/(/i — rj) 

2 ~ 1 - 4r] 



KoKfvA , 
(765) 

(766) 



which clearly has many solutions. A particularly neat one is to take rj = 1 so that the 
primary objects are 'gravitational momenta' and vr = 1 so that only two normal derivative 
terms appear in the 'bulk' term. Then ^ = ^ so the BEFE's read 

GrA = Lta + BrA , (767) 



Of course, it would make sense to particularly investigate the difficult geometrical prob- 
lem 'i?rA = on some embedded hypersurface' for such Bta = corresponding to no 
quadratic terms, since by the same arguments as above, such a model would be sure to give 
braneworld physics distinguishable from that hitherto studied. 



2.5 Two further comments about building SMS-type braneworlds 

First, so far we have talked in terms of the Yta = Tya — ^Jta split of the matter contribution 
to relate our work as clearly as possible to its predecessors in the literature. However, from 
the outset |,S;-i2j it was pointed out that this split is not unique. On these grounds we would 
prefer to work with the unique trace-tracefree split in which all the A/rA contributes to the 
trace part. The (4, 0) version of this split is used in B.3. 

Second, given a fixed type of bulk energy-momentum such as the Tab = of 'noncom- 
pact KK theory' or the Tab = ^9AB, then establishing an embedding requires the existence 
of a suitable compensatory characterization of the bulk geometry. The GR line of thought 
would be to only take results within such schemes seriously if they are robust to the ad- 
dition of bulk matter fields. Of course privileged choices of bulk could arise from further 
theoretical input. We argue below that the theoretical arguments behind some privileged 
choices in the literature for the bulk energy-momentum are not convincing enough to an- 
chor strongly credible physical predictions. One would rather require rigorous and general 
theoretical input following directly from some fundamental theory such as string theory. 

We now argue about the Tab = ^9AB choice of bulk (an example of which is pure AdS). 
This is clearly also always possible given the premises of a particular case of the generalized 
Campbell-Magaard result, but we have argued that this is not significant. We rather discuss 
the argument for pure AdS bulks from string theory. This is not generally justifiable since 
firstly, bulk gravitons are permitted so the bulk geometry would generically contain gravity 
waves. Secondly, bulk scalars ought to be permitted since they occur along with the graviton 
in the closed string spectrum 299j. The content of the closed string spectrum thus places 
interesting restrictions on bulk matter rather than completely abolishing it. From the 
perspective of 5-d GR, evidence for the stability of vacuum or AdS bulks (and of any 
resulting physical predictions) to the introduction of suitable bulk fields would constitute 
important necessary support for such models and their predictions. 

Finally note that use of arbitrary smooth bulk Tab does not affect the form of the 
junction conditions since only the thin matter sheet contribution to Tab enters these. The 
conclusion of our second point is that there is no good reason not to explore at least certain 
kinds of bulk matter in order to have a more general feel for how these thin matter sheet 
models behave j21L)j . 

2.6 Sideways York and thin-sandwich schemes with thin matter sheets 

Although (3, 1; 1) methods which build higher-d bulks about the privileged (3, 1) worlds 
by "z-dynamics" look tempting at the simplest level, the more advanced issues from B.l (of 



causality, and of well-posedness not being known) hold regardless of the presence of thin 
matter sheets. The issue of rough function spaces will become particularly relevant in the 
study of sufficiently general situations involving the evolution of thin matter sheets (see 
B.3.1). 

Beyond the objections in B.1.3, the Campbell-Magaard scheme is furthermore of limited 
use in models with thin matter sheets because the junction condition imposes restrictions 
on Kya which prevents these being subdivided into the knowns and unknowns of Magaard's 
method. 

We next consider the (3, 1) version of the York method applied on the thin matter 
sheet, but recall that even if this does provide data sets, one is next confronted with the 
difficulties of the (3, 1; 1) "evolution" scheme. To date (4, 1) worlds built from (3, 1) ones 
have relied on very simple specialized ansatze, such as 

A) z-symmetric surfaces K-pA = with known metric frA, whereupon the vacuum Codazzi 
equation is automatically satisfied and then i? = is required from the Gauss constraint. 

B) KrA = C frA with known Jya-, for example to obtain the Randall-Sundrum bulk |3U7j 
or slightly more general solutions JH]- Now the maximal subcase K = Q oi the CMC 
condition is a generalization of A) , whereas the full CMC condition itself is a generalization 
of B). Moreover, now the metric is to be treated as only known up to scale. Thus one would 
generally only know the full metric of each model's (3, 1) world once the 'wave Lichnerowicz 
equation' for the embedding of this world into the (4, 1) world is solved. So one loses the 
hold from the outset on whether each model will turn out to contain an interesting (3, 1) 
world. Nevertheless, some of the (3, 1) worlds will turn out to be of interest. Furthermore, 
one should question the sensibleness of any ideas involving the prescription of full (3, 1) 
metrics if the most general technique available fails to respect such a prescription. This point 
is more significant for (3, 1) data than for (3, 0) data because conformally-related metrics 
have different non-null geodesies. For (3, 0) data no physical significance is attached to 
spatial geodesies, but for (3, 1) there are timelike geodesies which are physically interpreted 
as paths of free motion of massive particles. So a (M, hab) spacetime which is conformally 
related to {M,hab) is different physically (for example one could violate energy conditions 
the other one does not violate). One can get out of this difficulty by either attaching no 
physical significance to one's inspired guesses for (M, hab) or by hoping for unobservably 
tiny deviations between the geodesic curves of the two geometries. 

In the specific case of thin matter sheets, by the j.c's. A) implies that YpA = 0, whilst B) 
implies that y is a hypersurface constant. The maximal condition implies that Y = whilst 
the CMC one implies that y is a hypersurface constant. So whereas the maximal and CMC 
ansatze are more general than A) and B) respectively, they are nevertheless restricted in this 
braneworld application. Notice, however, that a number of interesting cases are included: 
vacuum, radiative matter and electromagnetic matter are all among the Y = spacetimes. 

It is important to note that unlike the usual GR application, the choice of a hypersurface 
to be a brane is not a choice of slicing because localized energy-momentum is to be pinned 
on it. Almost all reslicings would fail to isolate this energy-momentum on a single slice. We 
know of no good reason why the brane should be CMC nor what value the CMC should 
take on it. However, at least this is a well-defined notion and it is substantially simpler to 
solve for than in general because of the decoupling of the Gauss and Codazzi constraints. 

As regards possible use of either thin or thick forms of the thin sandwich conjecture 
to treat branes, we first distinguish between thin sandwiches between 2 nearby branes and 
thin sandwiches which have a brane on one side and an undistinguished hypersurface on 
the other. One should be aware that non-intersection requirements may be different in 
these two cases, and also different from that of the original thin sandwich setting of 2 
unprivileged spacelike hyper surf aces. Second, one would have to take YpA as unknown 
until it can be deduced from the i^rA evaluated from the thin sandwich procedure. Finally 
we caution that thin sandwich schemes need not always exist. They require the "lapse" to 
be algebraically-eliminable, which for example is not the case for the analogous ^ of the 



KK split. 



2.7 (n, 1; 1) singularity removal and thin matter sheets 

First, there is less scope for it occurring along the lines of ()727() . For, as it is the energy- 
momentum on the brane, Ita is presumably finite for a nonsingular (4, 1) world so Kya. is 
finite and so cannot cause blowups in K"^ o — jK"^. However, blowups in this quantity 
can still occur if the inverse metric is badly-behaved (corresponding to / = 0). Note however 
that the application of the Cauchy-Kovalevskaya theorem to GR requires this not to be the 
case everywhere within the region of applicability. 

Second, one would of course require the investigation of more elaborate (e.g 'warped') 
embeddings than the example discussed above in order to investigate whether some embed- 
dings lead to 5-d geodesies that exit the brane. If the (3, l)-d geodesies on the brane are not 
included among those of the (4, l)-d bulk, it is not clear at all what is meant by 'singular' 
since one then would have to simultaneously consider some notion of extendibility for two 
congruences of privileged curves of different dimensionality. 

Under the decomposition H136|) . the geodesic equation ((T^ becomes 

+ f^A^x^x^ - 2K^AX^x^ = , (769) 

x^ + Kaj:x^x^ = . (770) 

if one upholds the use of normal coordinates. In order for ()769p to reduce in all cases to the 
(3, 1) geodesic equation on the brane, one requires x"*" = or K^a = everywhere on the 
brane. By (|770() . maintenance of i;-*- = along all geodesies is impossible unless Kas = 0. 
Thus in all cases JCas = is required on the brane. But this means that JCas^ = Kat.~ 
or equivalently Yas = i.e the absence of a brane. Thus in SMS-type braneworlds the 
(3, 1 ) geodesies need not be among the (4, 1 ) geodesies. 

One is trying to model confined matter. But the presence of brane-confined fields then 
means that brane-field particles fall off the brane if these follow (4, 1) geodesies, which 
cannot be a healthy state of affairs. If one rather postulates that these follow (3, 1) geodesies, 
then 

A) there is POE violation going on: bulk-traversable and brane-confined matter test-bodies 
undergo distinct free fall, which might have observable consequences. 

B) the on-brane causal structures due to brane-confined photons and due to bulk-traversable 
gravitons would generally be distinct. Thus the new, serious conceptual difficulties in B.1.6.3 
about the definition of singularities are indeed relevant to brane cosmology. 

3 (n, 0; —1) methods with thin matter sheets 
3.1 Hierarchy of problems and their difficulties 

We are here concerned with the study of (n -|- l)-d GR models with thin matter sheets using 
only well-studied, well-behaved mathematical techniques which also make good physical 
sense and are applicable to the full EFE's. In our view a good way of achieving this is via 
the (n, 0) IVP, followed where possible by the heavily-protected (n, 0; -1) CP. 

We provide a hierarchy of modelling assumptions together with associated difficulties, 
and argue that evolution is more problematic than data construction. We concentrate on the 
latter, making use of attractive features of the usual-signature York method in the new thin 
mater sheet setting. The differences between this setting and that of numerical relativity 
(c.f 1.2.9-11, App C) lead to the following difficulties. Firstly, certain thin matter sheet 
models require particularly rough mathematics. Secondly, they require novel asymptotics 
and boundary conditions (b.c's). 



Note that the increase in dimensionaUty itself does not affect much the data construc- 
tion nor the study of well-posedness and stabihty of the GR CP, although there is some 
dimensional dependence when the rougher function spaces come into use, due to the con- 
trast between the Hughes-Kato-Marsden theorem's requirement that the Sobolev class of 
the metric to be no rougher than H^~^^ (p 37) and that in all dimensions, the presence 
of thin matter sheets means that the metric is rougher than (the functions with con- 
tinuous second derivatives) because of the jump in the Riemann tensor at the sheet. This 
corresponds to the metric being too rough to belong to the Sobolev class H'^. Thus HKM's 
mathematics, the strongest used to date for both the GR IVP and CP [2^, is not generally 
powerful enough to deal with thin matter sheets. 

More powerful mathematics is thus desirable, and more may well appear over the next 
few years (see the program starting with j217j ). Realizations of part of the hierarchy of 
models below would be an interesting application for this sort of mathematics, which further 
motivates the study of increasingly rough Sobolev spaces, since the higher the dimension 
the greater the improvement to the established HKM mathematics is required for a full 
study of thin matter sheets. We must also mention that in spaces rougher than H'^ there is 
no guarantee of geodesic uniqueness |177j , so complete conventional physical sensibleness is 
generally lost no matter what rougher Sobolev class results are proved. This shortcoming is 
clearly dimension-independent (from the dimension-independence of the form of the geodesic 
equation) . 

The asymptotics difficulty (181) follows from supposing that 'AdS' bulks are desirable for 
string-inspired scenarios. The study of 'AdS' bulks by relativists would entail the study of 
asymptotically AdS bulks, to permit a more general study of disturbances close to the thin 
matter sheet. Whereas we will for the moment seek to avoid the function space difficulty, 
the asymptotics difficulty will remain in the 'tractable scenarios'. In the usual GR, the use 
of AdS asymptotics for the application of the York method to small-scale astrophysics was 
neglected because of the negligible effect of a cosmological constant on such small scales and 
because of DOD arguments. The braneworld application is however substantially different, 
so one might in this case have to study the York method with 'AdS asymptotics'. This 
could affect the tractability of the Lichnerowicz equation, and also whether the existence of 
the crucial maximal or CMC slices is as commonplace for these new spacetimes as it was 
for the (3, 1) GR ones. Furthermore, because of the interest in infinite planar branes, the 
asymptotics is directional, 'perpendicular' to the brane. However, in general there is no 
such notion of perpendicularity in GR. Whereas one can locally define geodesies and draw 
hypersurfaces perpendicular to them, this procedure in general eventually breaks down, for 
example due to (spatial) caustic formation. 

We start our hierarchy by envisaging the most general situation possible for thin matter 
sheets within the framework of GR. It is entirely legitimate to construct a (4, 0) initial data 
snapshot with whatever shape of thin matter sheet, but it is not legitimate to assume that 
any features of this are maintained over time unless it can be shown that the full evolution 
equations robustly maintain these features. Strictly, this is an initial-boundary problem, 
such as occurs for water waves or for the surface of a star. This is a very hard and quite 
new problem in GR |145| 114 jj . Thus the evolution step is particularly hard both in its full 
generality and in the justifiability of simplifications such as a fixed boundary. 

Consider a thin matter sheet in an asymmetric (not Z2 symmetric) bulk. One would 
ideally want to follow objects that crash into and possibly disrupt the thin matter sheet. 
One can then imagine asymmetric crashes which might punch though the thin matter 
sheet. The sheet could thicken or disperse with time. It could emit a significant amount of 
gravitational radiation, which could moreover include gravitational shock waves that spread 
out the H'^ character that initially pertained only to the thin matter sheet and not to the 
smooth surrounding bulk. That is, the H"^ character of a thin matter sheet data set J could 
typically spread to the whole of its causal future J^{J) (fig 19). 
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Fi gure 19: The spread of the H'^ region of a snapshot fl ^Pl containing the H'^ junction J between 
the 2 spacelike bulk pieces El and S_r upon which the prescribed data are smooth or analytic. Then the 
evolutions in the domains of dependence ©"""(El) and 2?^ (Eh) are smooth or analytic, but the causal future 
of J will typically be H^. 

In simple words, why should the junction remain unscathed? Why should the bulk in the 
immediate vicinity of the junction at later times be simple, smooth or known? Addressing 
these questions goes beyond the reach of present-day techniques. Rather we note that some 
usually-tacit assumptions are used to definitely avoid these difficulties, and that we do not 
know whether any of these are justifiable in the physics governed by the GR evolution 
equations. One can presuppose a thin matter sheet exists at all times to prevent it being 
created or destroyed. One can allow only bulks with regular thin matter sheet boundaries 
to preclude shock waves. One could also preclude asymmetric crashes by presupposing Z2 
symmetry. 

Even if discontinuous emissions are precluded, one can imagine smooth emissions and 
absorptions (symmetric or asymmetric) whereby the bulk interacts with the thin matter 
sheet. This would entail material leaking off or onto each side of the thin matter sheet. 
This can be precluded by the presupposition that the thin matter sheet's energy-momentum 
resides at all times on the sheet, which is encapsulated by the well-known 'equation of 
motion' of the thin matter sheet |2U51I27^ . This is a significant restriction on the dynamics 
of thin matter sheets in GR, often carried out in the name of tractability but which may 
not be realistic. 

In the Randall-Sundrum scenario, the brane can be placed anywhere in the bulk without 
affecting it. We strongly suspect that this is not a typical feature in GR-based braneworlds, 
nor is it desirable since it dangerously marginalizes the ontological status of the bulk. We 
take this feature to be too specialized to be included in the developments below. 

We close this section by briefly discussing modelling assumptions which avoid some 
of the above difficulties. We then implement these in two particular classes of tractable 
problems. The desirability of each modelling assumption strategy and of other modelling 
assumptions from a string-theoretic perspective is discussed in B.3.4. 

I) The choice of Z'^ symmetry may well look and indeed be arbitrary from a GR point 
of view and indeed from a string-theoretic point of view (see the end of this section). But 
without such a choice one simply cannot establish any specific embeddings if there are 
thin matter sheets (in the GR study of stars one requires the absence of surface layers to 
perform matchings). Thus one studies the restricted case with symmetry, in which one 
integrates only up to the b.c provided by the j.c. In the IVP part one can assume whatever 
configuration of thin matter sheets. Thus this step is more justifiable than the subsequent 
evolution, which may require some of the above ad hoc assumptions as to the existence 
and good behaviour of the thin matter sheet at all future times. So for the moment we are 
merely after the construction of (4, 0) initial data up to the junction. By themselves these 
data are useful in addressing issues such as the shape of the extensions of black hole horizons 
into the bulk, but the staticity and stability of these configurations remain unaddressed in 
the as- yet intractable evolutionary step of the problem |333j . 

II) One could choose instead to work with thick matter sheets i.e ones that are finitely 
rather than infinitely thin j57j . Then the above troubles with the function spaces need not 
occur, since the walls would then not be rough in the above sense, although they could still 



have the tractable level of roughness that is able to accommodate astrophysical objects just 
as in ordinary GR. It is then easier to envisage the study of the evolutionary step of the 
thick matter sheet problem quite high up our hierarchy, since this more closely resembles 
the stellar surface problem. 

Ill) One could investigate closed shells rather than open sheets in order not to require 
directional asymptotics. 



3.2 Thin matter sheet IVP 



We consider the jj, = case. Then by definition an extrinsic curvature Q'^"-^ = 0TTa6 
suffices in order to solve the Codazzi constraint. We begin by considering the (4, 0) case of 
the Lichnerowicz-York equation (|246j) . The j"" ^ case, while important, lies beyond the 
scope of this thesis. 

We work with the unphysical line element ds'^ = dz^ + e^dxadx" for some known trial 
function w = w{r,z), so as to model a 5^-symmetric object such as a black hole or a 
star on the brane. For the j"' = case, the IVP essentially reduces to the solution of the 
Lichnerowicz equation for some ■0 = z). We begin with the Tab = case of (the spatial 
projection of) (|742|) . We first find the b.c's for the Lichnerowicz equation, then provide 
algorithms and then comment on the underlying mathematical physics. 

Note that the b.c's are to be imposed on the physical metric and then written in terms 
of the unphysical quantities to have b.c's in terms of the objects we start off with and the 
unknown conformal factor ip. 

The inner radial b.c is commonly established using an isometry (c.f C.2.3). The Neu- 
mann reflection b.c case appropriate for a stellar source holds just as well here as in C.2.3. 
The case appropriate for a black hole is however distinct. Now, part of r = is singular 
and is approached by excision and an inversion-in-the-sphere isometry about some throat 
at radius a of the black hole within the apparent horizon [see fig 20a)]. But for braneworld 
black holes one does not know how far the black hole extends into the bulk. Indeed, the 
main point of the study is to find this out (pancakes versus cigars). Our idea is then to 
guess a profile r = f{z) along which to excise. About each point on r = f(z) an isometry 
in the corresponding 2-sphere may be applied [fig 20 b)], leading to an 
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The profile could be chosen so that it matches up smoothly with the r = and z = 1 
boundaries. Once the problem is solved, one can find out whether this profile was a good 
choice or not [fig 20 c)]. By picking a 1-parameter family of r = /(z) curves, one could then 
iterate until a satisfactory profile is found. 

For the new inner-z (z = 1) b.c, we impose the j.c's (|733() and H741|) . which now read 

^1(3 = ^a(3 ' " Kf3 = '^5 (y^/^ " ^iaf3^ (772) 

(N.B these are imposed on the physical quantities). Imposing Z2 symmetry, the second of 
these becomes 
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by use of the definition of the physical Ka/3 in normal coordinates and then factorizing out 
the CafB- This Z2 isometry b.c is more complicated than usual because of the presence of 
the thin matter sheet. 

That the Z2 symmetry is termed a reflection appears to generate a certain amount of 
confusion. This is because there is also the reflected wave notion of reflection, taken to 
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Figure 20: a) Excision of the inner region in (3, 0) black hole data construction by an inversion-in-the- 
sphere isometry identification between two copies of the black hole. This is along the lines of the method of 
images in electrostatics. 

b) Excision of the inner region in (4, 0) braneworld black hole data construction would involve making a 
guess r — f(z) for the excision region and then performing inversion-in-the-sphere isometrics pointwise to 
identify two copies of the black hole. 

c) One is then to numerically solve for the data. One can then find the apparent horizon. If this intersects 
r — f{z) (horizon 1) then the data may be bad in that the part of the shaded portion of the excised region 
may be in causal contact with its surroundings. If the apparent horizon extends too far past the end of 
the excision region (horizon 2), one might worry that the singularity also extends past the excised region 
in which case some of the data prescribed along r = should not have been prescribed since it lies on the 
singularity. 



give a Neumann b.c (such as in for the inner-radius b.c for a stellar source). In the case of 
Shiromizu and Shibata (SS) I 333j (see also |277j l. who furthermore treat our known w as 
their unknown, and our unknown ^/^ as the perfect AdS ■0 oc ^, it has the nice feature that 
the other 2 terms in H773|) cancel leaving a Neumann b.c 



dw 
dz 



= 0, (774) 

2 = 1 



which might look like a reflection in both of the above senses. However, this is only possible 
for tp oc which may be somewhat restrictive. Note also that H774() is an unusual 

notion of reflection in this context, since it is of the conformally-untransformed (3, 0)-metric, 
whereas the natural notion of reflection isometry would be for the conformally-transformed 
(4, 0)-metric. Thus we conclude that the presence of the notion of reflection (|774() in SS's 
work is happenstance: it is separate from the Z2 notion of reflection encapsulated in (|772j) 
and no analogue of it need hold in more general situations. 

Our b.c. H773|) contains a reflection part the other part being a (nonlinear) ab- 
sorption. This has the following implications. First, the presence of absorption terms is 
interesting since pure reflection is the boundary condition of a perfect insulator in the po- 
tential theory of heat. This suggests that pure reflection b.c's such as for pure AdS have 
built-in non-interaction of the bulk with its bounding brane, whereas our b.c may lead to 
(evolution models) with brane-bulk interactions. Whereas neither the planar symmetry in 
(4, 1) spacetime of the underlying thin matter sheet nor the S"^ symmetry within the (3, 
1) thin matter sheet of a simple astrophysical object generate gravity waves, we expect 
that spherical bumps on approximately planar branes in (4, 1) spacetimes to admit gravity 
waves, potentially giving rise to instabilities in the approximately AdS bulk braneworlds in 
which the branes contain astrophysical objects. Our b.c could permit the modelling of such 
interactions. Second the nonlinear absorption (much as arises in the theory of heat with 
temperature-dependent conductivity) is likely to complicate both analytic and numerical 
treatments of our b.v.p. 

The first point above is one reason to favour our approach over that of SS (subject to 
overcoming the complications due to the second point). Here is a further reason why our 



approach should be favoured on the long run. Whereas one could try to generalize SS's 
work by keeping the notion that w be unknown and known but not i, whereupon our b.c 
l|773|) is interpreted as an inhomogeneous Neumann b.c in w, the main trouble with SS's 
work is that there is no good reason for their method to be directly generalizable away from 
its restrictive assumption Qij = 0. Whereas treating w not ip as the unknown may give 
nicer b.c's, it is not tied to a method known to be amenable to less trivial solutions of the 
Codazzi equation than 6^- = 0. Our proposed method seeks a workable extension to the 
significantly more general case @Jj = Qjj^, and speculatively to completely general 

Another nice feature in SS's work is that the matter is scaled so that the linearized p.d.e 
obtained is a Poisson equation. This p.d.e is simply invertible to obtain detailed asymptotics 
[r48. 333 . For our scheme, the linearized equation (|794|) is of Helmholtz-type rather than 
Poisson, complicating such a procedure. We merely demand instead that -0 — ^ ^ for z 
large wherever possible so that our models are 'directionally asymptotically AdS'. As for 
large r, at least on the brane, one can impose asymptotic flatness as r — > oo. It is less 
clear what one should prescribe in this respect off the brane. A more detailed study of the 
asymptotics for our b.v.p's should be required as part of the actual construction of examples 
of data sets.^"'^'^ 

Our proposed b.v.p is thus the mixed, nonlinear problem 

A'i/' = Nonlin(^) , 
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or 

^ + 1 = on r = f{z) . (775) 
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One can use simplifications i) and/or ii) of C.2.2 on this and still have a more general 
case than that of SS. Simplification i) might be plausible because 5-d AdS clearly admits 
conformally-flat spatial sections [ds^ = ^{Pdz"^ +dxadx^)] or maybe not, since it is regarded 
with suspicion in the (3, 1) GR 2-body problem The trick at the end of C.2.2 cannot be 
used to obtain particularly simple examples due to the following argument. Unless one puts 
p DC A (constant) and has this maintained by non-scaling (i.e o = 2), it is overwhelmingly 
probable that the emergent A is not constant. Thus our freedom in a is used up to ensure 
that A is constant. Contrary to what York assumes in the ordinary GR context, we must 
permit /) < since our application requires a negative bulk cosmological constant, which 
further complicates the analysis of which cases are guaranteed to be well-behaved and 
renders p cancellation with m? unavailable. 

Our algorithm for solving this problem is as follows: 

i) Prescribe the following unphysical quantities: the metric hab and matter density p. 

ii) Pick any suitable O"'"^*-' to solve the Codazzi constraint. It follows that M is known. 

iii) Thus we can attempt to solve our b.v.p for the Lichnerowicz equation to obtain Tp. 

iv) Then we can compute the physical bulk metric hab and induced brane metric e^p of our 
snapshot. 

This assumes that we have correctly guessed the profile. One would now check whether 
this is the case by solving for the apparent horizon. If this is unsatisfactory [fig 20c)] then 
one would repeat the algorithm with adjusted profile. 

For nonvacuum branes {Tab 7^ 0), a similar argument enforcing a = 2 holds. The j.c 
may now be split into a trace b.c part, 

^^''Particularly because numerical integration is done on large but finite grids, subleading order asymptotics 
are helpful. 



and a restriction on the tracefree part of the matter on the brane, which is 

yip = (777) 

for the metric ansatz and coordinate choice used. Our algorithm now becomes 

i) One now requires the prescription of hac everywhere and of Y. 

ii) Pick any suitable O^Tji. hence M = Q'^ o ©'^ is known. 

iii) Attempt to solve our b.v.p for the Lichnerowicz equation. 

iv) Then we can compute hab and Cq,^. In this particular case, the simultaneous imposition 
of normal coordinates and an isotropic line element forces the restriction H777() on the 
braneworld matter. This is because one is applying too many coordinate restrictions; in 
this light the habitual practice of working in braneworlds using normal coordinates may 
be seen as a poor choice of gauge. If the above coordinate choices are not simultaneously 
made, the algorithm would contain further nontrivial equations [in place of H777|) ] to solve 
before the braneworld matter content can be deduced. 

We assume that this nonvacuum application is for a star or clump of dust on the brane 
in which case there is no need to excise a corner with a profile as was done above to deal 
with the black hole singularity. 

Notice the lack of control of the physical metrics characteristic of theoretical numerical 
relativity. Our non-scaling of the matter at least gives visible control over the matter. The 
sensibleness of doing this is in fact tied to the negativity of the bulk p: for p > and 
a > — 1, Lin receives a negative contribution with its tendency to encourage ill-posedness, 
whereas for predominantly p <0, the danger lies in a < — 1. 

A simpler example of b.v.p set up along the lines we suggest is that of Nakamura, 
Nakao and Mishima 2771 ■ Their metric is cylindrical not spherical and they have a slightly 
simpler version of (|773|) . Their case is still time-symmetric. The linearized equation is then 
precisely Helmholtz, facilitating its inversion and the consequent more detailed knowledge 
of the asymptotics. 

We also propose the 2-brane version of the above, in which the z — > oo condition 
is replaced by another 'parallel' brane boundary at z = I + I. This problem is close to 
that considered by Piran and Sorkin [339| . In contrast however, their study involves a 
periodically-identified fifth dimension to which there does correspond an isometry-based 
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We can provide a local (in function space) uniqueness proof for our proposed type of 
b.v.p, at the same level as that for standard GR black hole data in |393j . In the conformally- 
flat case, suppose that ipi = tp2 + u for u small. Then the homogeneous linearized b.v.p in 
u is applicable. We then have 

I {\Du\^ + Unu^)d^x = f {\Du\^ + uAu)d^x= / u^dS 
Jt, Jt. JdT, on 



= [ u^dS+f u^dS-[ u^dS-[ u^dS (779) 
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by Green's theorem. For u — > at least as fast as ^, the first surface integral tends to 
zero. For u — > at least as fast as ^ the second surface integral tends to zero. The 



fourth integral is zero by the Neumann reflection b.c. Upon applying the linearization of 
the nonlinear b.c (|773|) . 



du u ( dw ^ ~ , 



0, (780) 



the third surface integral is non-positive provided that the restriction ^ > holds. Thus 
since Lin > 0, u must be zero. For the non-conformally flat case, this argument requires 
R > 0. The argument is not seriously changed in the black hole case when a corner is 
excised by a profile with a Robin condition on it. 

Stronger existence and uniqueness proofs for our proposed problems are complicated 
by four factors: unboundedness, (albeit of the most benign kind), the boundary having 
corners, mixed b.c's (i.e a piecewise prescription on the boundary) and a portion of these 
being nonlinear. The first is however of the simplest kind and the second and third are 
commonplace. The last is less usual but perhaps not so bad either because by 



the problem can be written in divergence form, for which b.v.p's with nonlinear b.c's are 
treated in Ch. 10.2 of |243j . The trouble is that the treatment there, unlike here, is neither 
for mixed b.c's nor for an unbounded region with corners. Thus, the strongest point we can 
make at present is that there is good hope of obtaining existence and uniqueness results even 
for quite rough function spaces by more-or-less conventional, entirely rigorous mathematics 
for our proposed problem. This should be contrasted with the hopeless state of affairs 
with sideways prescriptions! Once one begins to get good numerical results, it becomes 
worthwhile to explicitly prove the well-posedness of the method used, in order to support 
those results and the ongoing production of more such results. In this particular case, these 
results would serve to support and understand the crucial numerical step iii) in the above 
algorithms. 

To permit the even greater generality required to have momentum flows (j" 7^ 0), by 
our route one is forced to forfeit the DEC control over the matter. This is because at 
least the A part of p cannot scale in accord with the scaling of j". We leave this further 
development, which additionally requires posing and solving the b.v.p following from the 
Codazzi equation, for a future project. 



3.3 Thick matter sheet IVP 



Many of the difficulties with evolution discussed above stem from the thinness of the matter 
sheets causing function-space-related problems. But ordinary physics should not be sensitive 
to which function space is used. One would hope that models with thick matter sheets would 
be more amenable to study, be good approximations to the thin matter sheets and in any 
case could be closer to reality than thin matter sheets. 

Bonjour, Charmousis and Gregory have used the (2, 1; 1) version of the general split 
(|148)) . (|149j) . (|150j) with scalar matter and consider both thick walls and the thin- wall limit 
perturbatively in the presence of gravity. They further specialize to the case of a spherical 
domain wall, which they show collapses. It would be worthwhile if this sort of example, 
which combines the modelling assumptions II) and III), is investigated using the 
(n, 0; -1) split subject to flat and to AdS asymptotics. 

For the particular example mentioned above, the matter profile is sigmoidal. However, 
to approximate a thin brane, one would want instead a hump profile, as indicated in fig 21. 
One could still use an inner b.c at 2; = 1, where now kij = so that the linear Robin b.c 



holds. One could in fact consider 2 problems: the flat sheet with directional asymptotics 
and the spherical sheet with a suitable inversion-in-the-sphere in place of the above inner 
b.c. Both of these problems avoid the main obstacles to existence and uniqueness proofs by 
possessing linear b.c's. The second problem is also invertible to an inner problem which is 
on a bounded region with no corners, for which results in |243j apply. 

Thin brane Thick brane 

matter 
distribution 



metric 
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Figure 21: The metric, extrinsic curvature and matter profiles for thin branes and thick branes. 

These problems are for 'plain' thick branes as opposed to branes containing spherically- 
symmetric objects. The first problem ought to more easily admit this extension. One might 
wish to study 'plain' thin branes too, e.g in the context of branes in relative motion. 

Finally, suppose instead one attempted to use something like Magaard's method to 
construct (4, 0) thin-matter-sheet data. Then one benefits from s = and the identification 
of xi = const with the z = 1 brane prescribes the topology, removes some of the sources 
of nonuniqueness and the brane's energy-momentum endows physical significance upon the 
(3, 0) covariance of the elimination procedure. In the thin matter sheet case, one is blocked 
by the nonanalyticity of p, but at least some thick matter sheet models could be built in 
this way. 



3.4 Discussion: modelling assumptions and stringy features 

We did not consider the possibility of a higher codimension which could substantially 
alter the nature of the embeddings used. For models with 2 or more times, it is simply not 
possible to stick entirely to our suggestion to build on established mathematical physics. 
Finally, our study could be complicated by having more than the Einstein tensor in the 
bulk. For example, one can have a nontrivial Lovelock tensor term j259| l8,Sj . or one could 
have genuine higher-derivative terms. Whilst our incipient results - the correspondence 
between (n, 0; -1) and (n - 1, 1; 1) schemes at the simplest level, or our BEFE ambiguity - 
will have counterparts in these theories, these theories' equations are much larger than the 
EFE's, and their CP/IVP is much less well-studied than that of GR. Higher derivatives or 
non-standard bulk matter might lead to the violation of the energy conditions assumed in 
this thesis. 

We next discuss which features of GR-based braneworld models are desirable for string 
theory. First for some caveats. We introduced the SMS braneworld as a GR generalization 
of the Randall-Sundrum scenario, moreover one that was as yet not sufficiently general for 
the purposes of GR in which the nature and existence of thin matter sheets at all times 
might not be a representative presupposition. It could be that it is other aspects of the 
Randall-Sundrum scenario that are of interest in string theory since it also has particle 
physics aspects to it and because it is a toy model of Hofava-Witten theory. One may 

^^^A starting-point for this study might be the case-by-case 4-d work of I67| . 



prefer to consider only these stringy scenarios rather than Randall-Sundrum or SMS. Also 
a great source of tension between string theorists and relativists is that the latter believe GR 
is suggestive of the importance of background-independent theories, whilst there is as yet 
no background-independent formulation of string theory available. Such ideas as conserved 
charges, flat sheets and simple bulks arise from assuming Minkowski or AdS backgrounds. 
Since GR is background-independent, one is interested there in generic solutions to the 
full field equations. This makes the notion of asymptotics necessary and subtle, and not 
necessarily compatible with flat sheets nor conserved charges. From a general perspective 
it is doubtful whether highly-specialized solutions such as Minkowski or AdS are likely to 
reflect the gravitational physics of the universe, unless it can be shown that large classes of 
generic solutions behave likewise or are attracted toward such solutions. 

String theory conventionally favours many extra dimensions, although it is not clear 
how many of these should be large and how many compactified. Among the corrections to 
the Einstein tensor mentioned above, string theorists might favour the Lovelock correction, 
since this is the only first-order correction for a heterotic string jl67j : an infinite series of 
further such correction terms is predicted. String theorists might favour very flat walls 
on the grounds that these could be stabilized by being the lowest-energy carriers of BPS 
charge \299\ I213j . These arguments would rule against the use of highly symmetric bulk 
manifolds /orbif olds, although string theorists might try to invoke them as fixed backgrounds 
so as to perturb about them. There may be different reasons for stability in GR and 
string theory. For example, in contrast to string-theoretic arguments favouring reflection- 
symmetric orbifolds, a GR domain wall could be stable precisely because the (3, 1) bulk 
is different on each side, as the wall separates domains in which symmetry is differently 
broken. On the whole, the string theorist would agree that bulks could be complicated 
by the presence of bulk matter from the closed string multiplet (1.3.3.5). At least in some 
models, gauge fields would be expected to occur only as fields confined to branes since 
they belong to the open string multiplet and some branes are where open strings end j299j . 
As strictly, the branes are of Planckian thickness, so it is not a disaster that thick branes 
are favourable toward rigorous mathematics for GR-based scenarios (this is recycling the 
argument in footnote 32). Of course, one could worry that they are thin enough for quantum 
gravitational effects to be important. 

Finally, we summarize the aspects of question 2 that should be asked as the partial 
answers to question 1 begin to be built up. Does the introduction of stringy features change 
the outcome to any of issues? And so, are GR-based braneworlds adequate or typical 
frameworks for string theory? What scenarios should one explore in order to furnish string 
theory with reasonably unique predictions? 



C Appendix on elliptic equations 



Let u take 1 to U, v take 1 to y and w take 1 to W. Let Q he a multi-index running over 
the integers wi to low, = Y.w=i Vq = g(^i)£i^,','a(^^)u.w ■ The system of U p.d.e's 

for V unknowns y'" of W variables (x"") 

A^^ix^)Vny'' = (783) 

n=o 



is elliptic if its principal symbol cr{kd) = A^"(x^)A:q 

(784) 

is positive-definite and invertible. 

Many elliptic equations are associated with the GR IVP. Below I build up to these 
starting from the simplest elliptic equations, which emerge anyway as special cases in the 
GR IVP. The purpose of using elliptic formulations is that they lend themselves well to 
general well-posedness proofs. But one may need to bear in mind in practice that they are 
computationally expensive (since, being instantaneous equations, their solution involves the 
whole numerical grid). 

1 Linear elliptic equations 

The Poisson equation in flat space (e.g Newtonian gravity or electrostatics) Ap^ = B 
is typically considered in a region 17 and supplemented by b.c's on d^l. Typical b.c choices 
include 

Dirichlet : A\g^ = K , (785) 



9A 

JNeumann : -— 
an 



= L for n the normal to dVl , (786) 

an 



Robin : [^ + fA 



, dA 
mixed : --— 
an 



, =M , (787) 

5o / an 

L, A\g^ =M,dn = dniUdn2- (788) 



These easily have good existence and uniqueness properties via Green's theorems It is prof- 
itable also to prove uniqueness for these from the maximum principle (see e.g |92| ) . a method 
which affords substantial generalization. Whereas the above problems are furthermore well- 
posed, the Cauchy data choice 



dA 
dn 



= L , (789) 



includes Hadamard's example, which fails to exhibit continuous dependence on the data. 
Solution methods for the b.v.p's H785H78^ include fundamental solution techniques and the 
method of images. 

The above treatment can straightforwardly be extended to (generalized inhomogeneous 
Helmholtz) equations of form 

{M'^didj + Q'di + S)A = B , M'^ positive definite and invertible . (790) 



A case of this is the following curved space equation: 



{h'^didj - r''dk)N = AN = NO + P , (791) 

where h^^ is regarded as some inverse n-space metric and F'^ = K'^V'^ij. These typically 
appear in this thesis for the CWB case. Then, the above types of proof benefit from the 
additional fact that all surface integrals vanish: 

= / D.NdS' = [ dOAiV = / dn{NR + 0) ^^N if NR + is of fixed sign. (792) 
Jan J J 

For the last step, suppose there is some point xq at which the integrand /(xq) = e > 0. 
Then for / continuous, |/(x) — /(xo)| < | V |x — xq] < 6, so I{x) > | V |x — xo| < S, so 
J d'^xl{x) > K6^e > 0, where K is some positive constant, which is a contradiction. 

The above nonexistence is the integral inconsistency of in.2. A well-known case is non- 
maintainability of maximal slicing in CWB GR, tied to the LFE A^A^ = NR. This works 
out because R is positive-definite from Ti. and the lapse N is strictly positive by definition, 
so the integrand is positive and hence cannot vanish. 

This is bypassed in GR by using CMC slicing instead. Then the LFE is rather 
AA = A (^R + + C(A), and C(A) may be taken to be negative to ruin the above 
argument. This is also the means by which CS+V theory and Newton- York absolute time 
theory work. Conformal gravity and Kelleher's theory work instead by having LFE's of 
a special integro-differential form A^A = NQ = NQ— < NQ > arising from adopting 
volume-divided actions. Then as J d^lNQ is trivially 0, the integral inconsistency becomes 
irrelevant by construction. Finally, note that if both tricks are applied simultaneously, 
C = so the CMC resolution is not then possible (this is why volume-divided Newton- 
York absolute time theory is just Kelleher's theory again). 



2 Lichnerowicz— York equation 



The arbitrary-dimensional (but most definitely s = 0!) Lichnerowicz- York equation is a 
nonlinear elliptic p.d.e 

Alp = Nonlin(V') = -cipiR - Mi;" + m^'' - 2^^") , 



arbitrary = 



(793) 



n - 2 ' ^ (1 - n)6 ' i ^ i V ' 

Note that the nonlinearity is of the relatively mild form known as quasilinearity |243j since 
I^VP does not appear (due to the artful construction on p 47). A number of theorems apply 
to equations of this form. 



2.1 Linearization 



First linearize it by setting tp = ^po + eipi, p = po + epi in ()793() and equate the 0(e) terms 
to obtain 

[A - Lin(^o)]V'i = Inh(Vo) , 



Lin(^o) = -c[R - (1 + c)M^l + m2(l + 6)^|j - 2(1 + a)piPl\ , Inh(Vo) = 2cpi^l+^ ■ (^94) 

If asymptotically hab — > h^i, and ■00 — *■ i^o-, one obtains [A^ — Lin(?/;|^)]0i = lnh(0^) 
from which the asymptotic behaviour of ip for the full equation may be obtained. 

NB this is an equation of the form (|79()|) . Many b.v.p's for such equations are well- 
studied inn I3U41 1243j . To give an idea of the techniques available, consider the Dirichlet 



problem: (|79Uj) in S together with b.c ipi = f on 9S. One then proves existence from 
uniqueness 243 , by setting up a sequence of problems interpolating between the flat-space 
Laplace equation and (|79Ujl .^''^^ 



2.2 The full equation and its simplifications 



One can then interpolate between quasilinear elliptic equations such as the full Lichnerowicz 
equation and their linearizations. Then one has topological theorems about the fixed points 
of suitable maps, such as Leray-Schauder degree theory |243j . to prove existence of the 
solutions to the quasilinear elliptic equation's Dirichlet problem. Similar techniques are 
available for further b.v.p's j243j . 

Thus in handling the Lichnerowicz equation, one has an enhanced chance of having a 
well-behaved b.v.p if the corresponding Lin > 0. To this end, notice that M and m? are 
always positive whereas the troublesome R is of variable sign. 

One may furthermore try three simplifications: 

i) Conformal flatness, which replaces A by the flat-space Ap and wipes out the arbitrary- 
sign R term. 

ii) Maximality: m = removes the highest-order term of Nonlin(i/)). 

iii) Setting M = 0. If this is combined with ii), one has moment of time symmetry Kij = 0, 
which is regarded as overly simple in the standard applications of numerical relativity as it 
corresponds to the absence of gravitational momentum on the slice S. 

One can sometimes also use a trick: to scale p like M, R or and then fix it so as to 
cancel with this term. 

In the j'^ 7^ case, the cancellation trick above is not available if one wishes the procedure 
to respect the DEC. Simplification i) is convenient in this case. 



2.3 Boundary conditions 



I consider these here for 3-d single spherical compact object data, in support of the braneworld 
black hole application in B.3. 

The inner radial b.c, is commonly established using an isometry. For example, for a 



stellar source, one could use hab{xa) = hah{—Xa) 



Xa=0 







dr 



leading 



component-by-component to the homogeneous Neumann b.c (the so-called reflection b.c) 



dr 



0, 



(795) 



and the restriction ^ 



on the valid form of the known function. 
For a black hole, r = is singular. One excises this by use of an inversion-in-the-sphere 
isometry about some throat at radius a of the black hole within the apparent horizon [see 
fig 20a)]. This gives an inner Robin condition 

di; 1 
dr 



0. 



(796) 



Asymptotic flatness provides a natural outer radius (r — > oo) Dirichlet b.c. Would 
like subleading terms because of working on a finite grid in practice. Sophisticated analysis 
might rather use a mode-selecting outer Robin condition. 

^^■^A word of caution as regards the applicability of this to 1)791^ equations: as /i*-' and V have geometrical 
significance, (179 H equation is on an underlying n-manifold E (generally with boundary). I thus caution that 
if E cannot be deformed to fiat space, the interpolation methods between the Laplacian on a portion of 
and our operator cannot apply. To avoid this trouble, one could study a sufficiently small region of S 
(as E is a manifold and hence locally 5R" or restrict the allowed topology of E to be the same as that of a 
portion of 5R" . 

^^■^One has no choice but to work with apparent horizons rather than event horizons if one is given a single 
spatial slice. At least the apparent horizon lies within the event horizon in GR. 



3 Techniques for solving the momentum constraint 



The moment of time symmetry simplification Kij = means no momentum constraint need 
be considered. This alongside conformal flatness and vacuum assumptions gives the simplest 
conformal method IVP: the remaining Lichnerowicz equation reduces to a flat-space Laplace 
equation, while its electrovacuum counterpart is two Laplace equations (the above and the 
electrostatic one). But moment of time symmetry corresponds to absence of gravitational 
momentum. More general data are thus desirable. In order to have a nonzero j'', one 
furthermore requires Kij to have a longitudinal part. The simplest longitudinal method is 
when the metric is conformally-flat. Then the 'Hodge decomposition' Wi = Vi + diU of the 
vector potential reduces the momentum constraint to n + 1 linear scalar Poisson 

fi — 2 

equations [223 ApU = -— TzdjV^ , ApVi = -eji . 

(797) 

which are what is solved for e.g Bowen-York data. 

The full momentum constraint is a well behaved vector elliptic equation. Recollect that 
it is decoupled from the Lichnerowicz equation by virtue of the CMC slicing. The conformal 
thin sandwich method requires all the constraints and the LFE to be treated together as a 
system. In both these cases the momentum constraint's good elliptic properties help ensure 
the whole system works out. 

In contrast to these fortunately elliptic formulations for the momentum constraint, its 
traditional sandwich counterpart (|229j) is only elliptic in small regions, so the traditional 
thin sandwich is not thereby furbished with powerful theorems. The Einstein-Maxwell 
extension of this system (including coupling to the electromagnetic Gauss constraint) is 
interesting, since the overall symbol becomes somewhat different jl57j . 

I note that my 'Codazzi equation' (|258|) is more complicated than usual. I do not expect 
this equation will be more than elliptic in small regions. The involvement of various values 
of W in theories in this thesis will directly affect the ellipticity properties of their thin 
sandwich formulations. That this alters the nature of the principal symbol may be easily 
inferred from ^.S92j . 
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Useful list of acronyms 

AF arbitrary frame 

BEFE braneworld Einstein field equation 

BM best matcliing, or best matclied 

CS conformal Superspace 

CMC constant mean curvature 

CWB compact without boundary 

DEC dominant energy condition 

DOD domain of dependence 

ELE Euler-Lagrange equations 

IDV "independent dynamical variable" 

LFE lapse-fixing equation 

POE principle of equivalence 

POT Problem of time 

RCS relative configuration space 

RI reparameterization invariant 

RP relativity principle 

RWR relativity without relativity 

SSF split spacetime framework 

THEFE timelike hypersurface Einstein field equation 

TSA 3-space approach 

WDE Wheeler-DeWitt equation 



